Boosting SpLSA for Text Classification

Hurtado, Julio; Mendoza, Marcelo; Nanculef, Ricardo

Boosting SpLSA for Text Classification

dc.catalogador	grr
dc.contributor.author	Hurtado, Julio
dc.contributor.author	Mendoza, Marcelo
dc.contributor.author	Nanculef, Ricardo
dc.date.accessioned	2024-05-28T21:12:58Z
dc.date.available	2024-05-28T21:12:58Z
dc.date.issued	2017
dc.description.abstract	Text classification is a challenge in document labeling tasks such as spam filtering and sentiment analysis. Due to the descriptive richness of generative approaches such as probabilistic Latent Semantic Analysis (pLSA), documents are often modeled using these kind of strategies. Recently, a supervised extension of pLSA (spLSA [10]) has been proposed for human action recognition in the context of computer vision. In this paper we propose to extend spLSA to be used in text classification. We do this by introducing two extensions in spLSA: (a) Regularized spLSA, and (b) Label uncertainty in spLSA. We evaluate the proposal in spam filtering and sentiment analysis classification tasks. Experimental results show that spLSA outperforms pLSA in both tasks. In addition, our extensions favor fast convergence suggesting that the use of spLSA may reduce training time while achieving the same accuracy as more expensive methods such as sLDA or SVM.
dc.fuente.origen	Converis
dc.identifier.converisid	1
dc.identifier.doi	10.1007/978-3-319-52277-7_18
dc.identifier.eissn	1611-3349
dc.identifier.issn	0302-9743
dc.identifier.scopusid	SCOPUS_ID:2-s2.0-85013474291
dc.identifier.uri	https://repositorio.uc.cl/handle/11534/85937
dc.identifier.wosid	WOS:000418399200018
dc.language.iso	en
dc.nota.acceso	sin adjunto
dc.pagina.final	149
dc.pagina.inicio	142
dc.relation.ispartof	PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2016
dc.revista	Lecture Notes in Computer Science
dc.rights	acceso abierto
dc.title	Boosting SpLSA for Text Classification
dc.type	comunicación de congreso
dc.volumen	10125
sipa.trazabilidad	Converis;20-07-2021

Collections

Artículos de conferencia

Boosting SpLSA for Text Classification

Files

Collections