
ADAPTING TO CHANGE: ENSURING ROBUST TEXT CLASSIFICATION IN DYNAMIC ENVIRONMENTS THROUGH CONFOUNDING SHIFT NAVIGATION
Aron Landeiro , Department of Mathematics and Computer Science University of Wroclaw, PolandAbstract
This study addresses the challenge of maintaining robust text classification in dynamic environments characterized by confounding shifts. As information landscapes evolve, the performance of text classification models can be compromised due to changes in data distributions. Our research introduces a novel approach for adapting to these shifts, emphasizing the importance of confounding shift navigation. By employing advanced techniques, our methodology enhances the resilience and accuracy of text classifiers in the face of evolving contexts, ensuring their effectiveness over time. This paper explores the theoretical foundations and practical implications of our approach, offering valuable insights for applications requiring enduring text classification performance.
Keywords
Confounding Shifts, Text Classification, Dynamic Environments
References
Angrist, J. D., & Pischke, J.-S. (2008).Mostly harmless econometrics: An empiricist’scompanion. Princeton university press.
Bareinboim, E., Tian, J., & Pearl, J. (2014). Recovering from selection bias in causal andstatistical inference. InProceedings of The Twenty-Eighth Conference on ArtificialIntelligence (CE Brodley and P. Stone, eds.). AAAI Press, Menlo Park, CA.
Bickel, S., Br ̈uckner, M., & Scheffer, T. (2009). Discriminative learning under covariateshift.Journal of Machine Learning Research,10(Sep), 2137–2155.
Chen, X., Monfort, M., Liu, A., & Ziebart, B. D. (2016). Robust covariate shift regression.InArtificial Intelligence and Statistics, pp. 1270–1279.
Chesher, A. (1991). The effect of measurement error.Biometrika,78(3), 451–462.
Dahll ̈of, M. (2012). Automatic prediction of gender, political affiliation, and age in swedishpoliticians from the wording of their speeches - A comparative study of classifiability.LLC,27(2), 139–153.
De Choudhury, M., Kiciman, E., Dredze, M., Coppersmith, G., & Kumar, M. (2016). Dis-covering shifts to suicidal ideation from mental health content in social media. InProceedings of the 2016 CHI Conference on Human Factors in Computing Systems,pp. 2098–2110. ACM.
Dehejia, R. H., & Wahba, S. (2002). Propensity score-matching methods for nonexperimen-tal causal studies.Review of Economics and statistics,84(1), 151–161.
Gelman, A., & Hill, J. (2006).Data analysis using regression and multilevel/hierarchicalmodels. Cambridge university press.
Hajian, S., & Domingo-Ferrer, J. (2013). A methodology for direct and indirect discrimina-tion prevention in data mining.Knowledge and Data Engineering, IEEE Transactionson,25(7), 1445–1459.
Hemphill, L., Culotta, A., & Heston, M. (2016). #polar scores: Measuring partisanshipusing social media content.Journal of Information Technology & Politics,0(ja).
Article Statistics
Downloads
Copyright License
Copyright (c) 2024 Aron Landeiro

This work is licensed under a Creative Commons Attribution 4.0 International License.