Boosting SpLSA for Text Classification
dc.catalogador | grr | |
dc.contributor.author | Hurtado, Julio | |
dc.contributor.author | Mendoza, Marcelo | |
dc.contributor.author | Nanculef, Ricardo | |
dc.date.accessioned | 2024-05-28T21:12:58Z | |
dc.date.available | 2024-05-28T21:12:58Z | |
dc.date.issued | 2017 | |
dc.description.abstract | Text classification is a challenge in document labeling tasks such as spam filtering and sentiment analysis. Due to the descriptive richness of generative approaches such as probabilistic Latent Semantic Analysis (pLSA), documents are often modeled using these kind of strategies. Recently, a supervised extension of pLSA (spLSA [10]) has been proposed for human action recognition in the context of computer vision. In this paper we propose to extend spLSA to be used in text classification. We do this by introducing two extensions in spLSA: (a) Regularized spLSA, and (b) Label uncertainty in spLSA. We evaluate the proposal in spam filtering and sentiment analysis classification tasks. Experimental results show that spLSA outperforms pLSA in both tasks. In addition, our extensions favor fast convergence suggesting that the use of spLSA may reduce training time while achieving the same accuracy as more expensive methods such as sLDA or SVM. | |
dc.fuente.origen | Converis | |
dc.identifier.converisid | 1 | |
dc.identifier.doi | 10.1007/978-3-319-52277-7_18 | |
dc.identifier.eissn | 1611-3349 | |
dc.identifier.issn | 0302-9743 | |
dc.identifier.scopusid | SCOPUS_ID:2-s2.0-85013474291 | |
dc.identifier.uri | https://repositorio.uc.cl/handle/11534/85937 | |
dc.identifier.wosid | WOS:000418399200018 | |
dc.language.iso | en | |
dc.nota.acceso | sin adjunto | |
dc.pagina.final | 149 | |
dc.pagina.inicio | 142 | |
dc.relation.ispartof | PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2016 | |
dc.revista | Lecture Notes in Computer Science | |
dc.rights | acceso abierto | |
dc.title | Boosting SpLSA for Text Classification | |
dc.type | comunicación de congreso | |
dc.volumen | 10125 | |
sipa.trazabilidad | Converis;20-07-2021 |