Cross-Cultural Sentiment Analysis of User Texts on Twitter

Download paper
Svetlana S. Bodrunova

Doctor of Political Science, Professor at the Chair of Management in Mass Communications, School of Journalism and Mass Communications, St. Petersburg State University, Saint Petersburg, Russia

e-mail: s.bodrunova@spbu.ru

Section: New Media

The paper reviews today’s most successful approaches to sentiment analysis of massive datasets of user-generated texts, including those from Twitter. We define today’s most developed areas of sentiment studies and their limitations, as well as methodological, technological, and other challenges that sentiment analysis faces in its variations across the world. We also group the existing research into clusters based on several criteria, including presence/absence of machine learning, unit of analysis, and object of study. We show that the creation of cross-cultural multilingual sentiment analysis and tools for it is a major task that today’s sentiment studies face; such tools would allow detecting sentiment across a range of languages and cultures. We assess the existing tools for multi-lingual sentiment analysis and suggest a conceptual framework for future studies of sentiment in different language domains of Twitter.

Keywords: sentiment analysis, sentiment, Twitter, computational social science, cross-cultural sentiment analysis
DOI: 10.30547/vestnik.journ.6.2018.191212

References:

Agarwal A., Xie B., Vovsha I., Rambow O., Passonneau R. (2011) Sentiment Analysis of Twitter Data. In Proc. ACL 2011 Workshop on Languages in Social Media. Pp. 30–38.

Araujo M., Reis J., Pereira A., Benevenuto F. (2016) An Evaluation of Machine Translation for Multilingual Sentence-Level Sentiment Analysis. In Proceedings of the 31st Annual ACM Symposium on Applied Computing. Pp. 11401145. 

Baccianella S., Esuli A., Sebastiani F. (2010) SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining. In LREC Vol. 10. Pp. 22002204.

Balahur A., Turchi M., Steinberger R., Ortega J. M. P., Jacquet G., Küçük D., El Ghali A. (2014) Resource Creation and Evaluation for Multilingual Sentiment Analysis in Social Media Texts. In LREC. Pp. 42654269.

Balamurali A. R., Khapra M. M., Bhattacharyya P. (2013) Lost in Translation: Viability of Machine Translation for Cross-Language Sentiment Analysis. In International Conference on Intelligent Text Processing and Computational Linguistics. Berlin, Heidelberg: Springer. Pp. 3849.

Barbosa L., Feng J. (2010) Robust Sentiment Detection on Twitter from Biased and Noisy Data. In Proc. of 23rd Int. Conf. on Computational Linguistics: Posters (COLING ‘10), Association for Computational Linguistics, Stroudsburg, PA, USA. Pp. 36–44.

Becker K., Moreira V. P., dos Santos A. G. (2017) Multilingual Emotion Classification Using Supervised Learning: Comparative Experiments. Information Processing & Management 53 (3): 684704.

Bodrunova S. S. (2018) When Context Matters. Analyzing Conflicts with the Use of Big Textual Corpora from Russian and International Social Media. Partecipazione E Conflitto11 (2): 497–510.

Bodrunova S. S., Blekanov I. S., Maksimov A. (2016) Measuring Influencers in Twitter Ad-Hoc Discussions: Active Users vs. Internal Networks in the Discourse on Biryuliovo Bashings in 2013. In Artificial Intelligence and Natural Language Conference (AINL), IEEE. Pp. 110.

Bodrunova S. S., Litvinenko A. A., Gavra D. P., Yakunin A. V. (2015) Twitter-Based Discourse on Migrants in Russia: The Case of 2013 Bashings in Biryulyovo. International Review of Management and Marketing 5 (1S).

Ceron A., Curini L., Iacus S. M., Porro G. (2014) Every Tweet Counts? How Sentiment Analysis of Social Media Can Improve our Knowledge of Citizens’ Political Preferences with an Application to Italy and France. New Media & Society 16 (2): 340–358.

Chen, Y., Skiena, S. (2014) Building Sentiment Lexicons for All Major Languages. In ACL (2). Pp. 383389.

Chetviorkin A., Loukachevitch N. (2012) Extraction of Russian Sentiment Lexicon for Product Meta-Domain. In Proc. of the 24th International Conference on Computational Linguistics (COLING), Bombay, India. Pp. 593–610.

Chetviorkin A., Loukachevitch N. (2013) Evaluating Sentiment Analysis Systems in Russian. In Proc. of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, Sofia, Bulgaria. Pp. 12–17.

Chikersal P., Poria S., Cambria, E. (2015) SeNTU: Sentiment Analysis of Tweets by Combining a Rule-Based Classifier with Supervised Learning. In SemEval@ NAACL-HLT. Pp. 647651.

Cruz F. L., Troyano J. A., Pontes B., Ortega F. J. (2014) Building Layered, Multilingual Sentiment Lexicons at Synset and Lemma Levels. Expert Systems with Applications 41 (13): 59845994.

Dave K., Lawrence S., Pennock D.M. (2003) Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews. In Proceedings of WWW. Pp. 519–528.

Dos Santos C. N., Gatti, M. (2014) Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts. In COLING. Pp. 69–78.

Duh K., Akinori F., Masaaki N. (2011) Is Machine Translation Ripe for Cross-Lingual Sentiment Classification? In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics:short papers (ACL-2011).

Gonçalves P., Araujo M., Benevenuto F., Cha M. (2013) Comparing and Combining Sentiment Analysis Methods. In Proceedings of the 1 st ACM Conference on Online Social Networks (COSN), Boston, USA, ACM. Pp. 27–38.

Heerschop B., Goossen F., Hogenboom A., Frasincar F., Kaymak U., de Jong F. (2011) Polarity Analysis of Texts Using Discourse Structure. In Proceedings of the 20th ACM international conference on Information and knowledge management. Pp. 10611070.

Kan D. (2012) Rule-Based Approach to Sentiment Analysis at ROMIP 2011. Available at: http://www.dialog-21.ru/digests/dialog2012/materials/pdf/Kan.pdf

Klekovkina M. V., Kotel’nikov E. V. (2012) Metod avtomaticheskoy klassifikatsii tekstov po tonal’nosti, osnovannyy na slovare emotsional’noy leksiki [Method of Automatic Text Classification by Sentiment Based on the Dictionary of Sentiment Lexicon]. In Trudy XIV Vserossiyskoy nauchnoy konferentsii «Elektronnyye biblioteki: perspektivnyye metody i tekhnologii, elektronnyye kollektsii [Proceedings of the 14th All-Russian scientific conference “Electronic Libraries: Promising methods and Technologies, Electronic Collections]. Pp. 118–123. (In Russian)

Kontopoulos E., Berberidis C., Dergiades T., Bassiliades N. (2013) Ontology-Based Sentiment Analysis of Twitter Posts. Expert systems with applications 40 (10): 40654074.

Kouloumpis E., Wilson T., Moore J. D. (2011) Twitter Sentiment Analysis: the Good the Bad and the Omg! Icwsm 11 (538541).

Liu B. (2010) Sentiment Analysis and Subjectivity. In Handbook of Natural Language Processing, 2. Pp. 627666.

Liu B. (2012) Sentiment Analysis and Opinion Mining. In Synthesis lectures on human language technologies 5 (1): 1167.

Loukachevitch N., Blinov P., Kotelnikov E., Rubtsova Y., Ivanov V., Tutubalina E. (2015) SentiRuEval: Testing Object-Oriented Sentiment Analysis Systems in Russian. In Proceedings of International Conference Dialog. Vol. 2. Pp. 313.

Maynard D., Greenwood M. A. (2014) Who Cares About Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis. In LREC. Pp. 42384243.

Mozetič I., Grčar M., Smailović J. (2016) Multilingual Twitter Sentiment Classification: the Role of Human Annotators. PloS one 11 (5): e0155036.

Nakov P., Rosenthal S., Kiritchenko S., Mohammad S. M., Kozareva Z., Ritter A., Zhu X. (2016) Developing a Successful SemEval Task in Sentiment Analysis of Twitter and Other Social Media Texts. Language Resources and Evaluation 50 (1): 3565.

Nielsen F. Å. (2011) A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs. In arXiv preprint arXiv:1103.2903.

Ohmura M., Kakusho K., Okadome T. (2014) Social Mood Extraction from Twitter Posts with Document Topic Model. In Information Science and Applications (ICISA), 2014 International Conference on. Pp. 14.

Pak A., Paroubek P. (2010) Twitter as a Corpus for Sentiment Analysis and Opinion Mining. In LREc, Vol. 10, No. 2010.

Pang B., Lee L. (2008) Opinion Mining and Sentiment Analysis. Foundations and Trends® in Information Retrieval 2 (1–2): 1135.

Pang B., Lee L., Vaithyanathan S. (2002) Thumbs up?: Sentiment Classification Using Machine Learning Techniques. In Proceedings of the ACL Conference on Empirical Methods in Natural Language Processing. Pp. 79–86.

Pazel’skaya A. G., Solov’yev A. N. (2011) Metod opredeleniya emotsiy v tekstakh na russkom yazyke [The Method of Sentiment Analysis of Texts in Russian]. In Komp’yuternaya lingvistika i intellektual’nyye tekhnologii: Po materialam ezhegodnoy Mezhdunarodnoy konferentsii «Dialog» (Bekasovo, 25–29 maya 2011 g.) [Computer Linguistics and Intelligence Technologies: based on the proceedings of the annual International Conference “Dialog” (Bekasovo, May 25–29, 2011]. Moscow: RGGU Publ. (In Russian)

Ponomareva N., Thelwall M. (2012) Do Neighbours Help?: an Exploration of Graph-Based Algorithms for Cross-Domain Sentiment Classification. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Pp. 655665.

Pontiki M., Galanis D., Papageorgiou H., Androutsopoulos I., Manandhar S., AL-Smadi M., Hoste V. (2016) SemEval-2016 task 5: Aspect-Based Sentiment Analysis. In ProWorkshop on Semantic Evaluation (SemEval-2016). Pp. 1930.

Poursepanj H., Weissbock J., Inkpen D. (2013) uOttawa: System Description for SemEval 2013 Task 2 Sentiment Analysis in Twitter. In SemEval@ NAACL-HLT. Pp. 380383.

Rosenthal S., Farra N., Nakov P. (2017) SemEval-2017 task 4: Sentiment Analysis in Twitter. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017). Pp. 502518.

Saif H., He Y., Alani H. (2012) Semantic Sentiment Analysis of Twitter. The Semantic Web–ISWC 2012. Pp. 508524.

Schuller B., Knaup T. (2011) Learning and Knowledge-Based Sentiment Analysis in Movie Review Key Excerpts. In Toward Autonomous, Adaptive, and ContextAware Multimodal Interfaces. Theoretical and Practical Issues. Pp. 448472.

Serrano-Guerrero J., Olivas J. A., Romero F. P., Herrera-Viedma E. (2015) Sentiment Analysis: a Review and Comparative Analysis of Web Services. Information Sciences 311: 1838.

Shalunts G., Backfried G. (2016) Multilingual Sentiment Analysis on Data of the Refugee Crisis in Europe. In DATA ANALYTICS 2016. 

Sharma A., Dey S. (2013) A Boosted SVM Based Sentiment Analysis Approach for Online Opinionated Text. In Proceedings of the 2013 Research in Adaptive and Convergent Systems. Pp. 2834.

Smailović J., Grčar M., Lavrač N., Žnidaršič M. (2013) Predictive Sentiment Analysis of Tweets: A Stock Market Application. In Human-Computer Interaction and Knowledge Discovery in Complex, Unstructured, Big Data. Pp. 7788.

Sokolova M., Bobicev V. (2009) Classification of Emotion Words in Russian and Romanian Languages. In RANLP. Pp. 416420.

Steinberger J., Lenkova P., Kabadjov M. A., Steinberger R., Van der Goot E. (2011) Multilingual Entity-Centered Sentiment Analysis Evaluated by Parallel Corpora. In RANLP. Pp. 770775.

Stieglitz S., Dang-Xuan L. (2012) Political Communication and Influence Through Microblogging An Empirical Analysis of Sentiment in Twitter Messages and Retweet Behavior. In System Science (HICSS), 2012 45th Hawaii International Conference on. Pp. 35003509.

Suchdev R., Kotkar P., Ravindran R., Swamy, S. (2014) Twitter Sentiment Analysis Using Machine Learning and Knowledge-Based Approach. International Journal of Computer Applications 103 (4).

Tellez E. S., Miranda-Jiménez S., Graff M., Moctezuma D., Suárez R. R., Siordia O. S. (2017) A Simple Approach to Multilingual Polarity Classification in Twitter. In Pattern Recognition Letters.

Thelwall M. (2013) Heart and Soul: Sentiment Strength Detection in the Social Web with Sentistrength. Proceedings of the CyberEmotions 5: 114.

Ustalov D. A. (2012) Izvlecheniye terminov iz russkoyazychnykh tekstov pri pomoshchi grafovykh modeley [Deriving Terms from Russian Texts by Means of Graph Models]. In Teoriya grafov i prilozheniya. Materialy konferentsii [Graphs Theory and Applications. Proceedings of the conference]. Moscow. Pp. 62–69. (In Russian)

Verma S., Vieweg S., Corvey W. J., Palen L., Martin J. H., Palmer M., Anderson K. M. (2011) Natural Language Processing to the Rescue? Extracting “Situational Awareness” Tweets During Mass Emergency. In ICWSM.

Vilares D., Alonso M. A., Gómez-Rodríguez C. (2017) Supervised Sentiment Analysis in Multilingual Environments. Information Processing & Management 53 (3): 595607.

Wang H., Can D., Kazemzadeh A., Bar F., Narayanan S. (2012) A System for Real-Time Twitter Sentiment Analysis of 2012 US Presidential Election Cycle. In Proceedings of the ACL 2012 System Demonstrations. Pp. 115–120.

Wilson T., Hoffmann P., Somasundaran S., Kessler J., Wiebe J., Choi Y., Cardie C., Riloff E., Patwardhan S. (2005) OpinionFinder: A System for Subjectivity Analysis. In Proceedings HLT/EMNLP, Vancouver (BC). Pp. 34–35.

Yessenov K., Misailovic S. (2009) Sentiment Analysis of Movie Review Comments. Methodology 17: 17.

Yussupova N., Bogdanova D., Boyko M. (2012) Application of Sentiment Analysis to Texts in Russian Based on the Machine Learning Approach. In Proceedings of Second International Conference on Advances in Information Mining and Management. Pp. 814.

Zhou X., Tao X., Yong J., Yang Z. (2013) Sentiment Analysis on Tweets for Social Events. In Computer Supported Cooperative Work in Design (CSCWD), 2013 IEEE 17th International Conference on. Pp. 557562.

Zhu X., Kiritchenko S., Mohammad S. (2014) NRC-Canada-2014: Recent Improvements in the Sentiment Analysis of Tweets. In SemEval@ COLING. Pp. 443447.