Articles Comments

The WebGenre Blog: The power of genre applied to digital information. By Marina Santini » Archive

Impact of Sociolinguistics in Opinion Mining Systems

Signed post by Alexander Osherenko, Socioware Development, osherenko@socioware.de Full paper: Considering Impact of Sociolinguistic Findings in Believable Opinion Mining Systems Proceedings of The Fifth International Conference On Cognitive Science. 2012. Kalinigrad, Russia (http://www.informatik.uni-augsburg.de/~osherenk/final_kalinigrad.pdf) Opinions are frequent means of communication in human society and automatic approaches to opinion mining in texts attracted therefore much attention. All in all, most approaches apply data mining techniques and extract lexical features (words) as reliable means of classi cation. Noteworthy that although the interest in opinion mining is huge, there are only few explorations on words extracted in opinion mining. This study considers this drawback and elaborates on a sociolinguistic explanation. We hypothesize: an opinion mining system should be trained for classifying opinions in texts of the same language style. Hence, this contribution focuses on the following questions: 1) do sociolinguistic … Read entire article »

Filed under: collaborative blogging, computational models, dialectic, discussions, dissemination, featured, reading suggestions, signed posts

White Paper: Automatic Genre Identification – Testing with Noise

Automatic Genre Identification – Testing with Noise by Efstathios Stamatatos, Serge Sharoff, Marina Santini – Copyright © 2012, All rights reserved.   Citation:  Stamatatos E., Sharoff S., Santini M. (2012). Automatic Genre Identification – Testing with Noise. [White paper]. Retrieved from http://www.forum.santini.se/2012/03/white-paper-automatic-genre-identification-testing-with-noise/ The genre collections used in the experiments are available here. The reference list is here. In the experiments described below, genre classes coming from three genre collections have been used: Santinis7 (Santini, 2007). KI-04 (Meyer zu Eissen and Stein, 2004), and HGC (Stubbe and Ringlstetter, 2007). These genre collections have been created by different people, in different universities, for different purposes, with different criteria, and different notions of what genre is. Since genre is a complex concept and genre classes can be characterized in different ways, we assume that having a AGI algorithm … Read entire article »

Filed under: collaborative blogging, computational models, featured, signed posts, white papers

Question: How do you inject emotions into chatbots, Andres?

Post signed by: ANDRES TOMÁS HOHENDAHL, NLP Researcher http://web.fi.uba.ar/~ahohenda Injecting emotions is a rather complicated task, so I started by using lots of heuristics, classifying the user inputs and the bot answers, trying to figure out emotional communication and frustration states, as the emotional state is a result of several ongoing acts, and somehow measures or emotional readouts of the result of those interactive acts. In other words if the bot gets a question, and knows how to answer it, he gets happier, if the user denies or objects the answer, he gets a little frustrated, if the user says nonsenses (not understandable things) the bot gets curious and asks further, if the user employs harsh terms, he might go into a defensive-first and angry after state, all was based on the … Read entire article »

Filed under: collaborative blogging, queries, signed posts

Provocation or Food for thought? Web sites as AI Mind Persons

Post signed by: Arthur T. Murray, independent scholar in artificial intelligence One difference between natural and artificial intelligence is that a human being can have a webpage, while an artificial intelligence can be a webpage. There are potentially many genres of artificial intelligence, such as robot AI embodied in a robot; Web-resident AI surviving on a server with or without robotic embodiment; and cyborg AI as a cybernetic organism sharing “meatspace” with a host substrate. Let us focus on the Web-resident AI Mind that both has and is a webpage. … Read entire article »

Filed under: collaborative blogging, dialectic, discussions, signed posts

Web Genre and the Emergent Properties of Web Sites

Post signed by: Maya Dimitrova, Institute of Control and System Research, Bulgarian Academy of Sciences Previous posts: Gestalt Processes in User-Web Interaction: A Two-Side View & User-Web Interaction: Gestalt in Information Retrieval Maya tries to explain the process of development of the Web as a social medium and the emergence of Web genres with the Gestalt theory, explaining it first from a psychological perspective. 4   User Account of the Emergent Properties of Web Sites 4.1 Holistic understanding of aspects of Web genre In the discussed studies on Web genre a multidimensional structure of the genre components is assumed. For a multidimensional structure, however, it is important to test the orthogonality assumption. The main hypothesis in our studies has been to see if users perceive these dimensions as independent, so that they can be meaningfully displayed as … Read entire article »

Filed under: collaborative blogging, discussions, signed posts

User-Web Interaction: Gestalt in Information Retrieval

Post signed by: Maya Dimitrova, Institute of Control and System Research, Bulgarian Academy of Sciences * In this post, all references, figures and tables have been removed by the blog’s moderator. [Part II] 3   Gestalt in Information Retrieval A group of information retrieval studies is concerned with identifying new linguistic, lexical or formal features (like the special tags) that can be captured by automatically processing html scripts – scanning, tokenizing, clustering – and extracting meaningful information to identify the style or genre of the text inside the Web page. Web genre in the discussed group of studies is defined as a multi-dimensional structure of features of text and html design pointing out at various linguistic and cognitive aspects of the retrieved Web document to help the user find not just the relevant topic, but … Read entire article »

Filed under: collaborative blogging, dialectic, discussions, signed posts

Gestalt Processes in User-Web Interaction: A Two-Side View

Post signed by: Maya Dimitrova, Institute of Control and System Research, Bulgarian Academy of Sciences * In this post, all references, figures and tables have been removed by the blog’s moderator. [Part I] 1. Introduction The Web is developing adaptively and displays certain natural processes that we find also in other areas, for example in the evolution and synthesis of knowledge through scientific research. This process fits very well with the implicit nature of human learning and with our natural human ability to synthesize and systematize new information, even in the face of the exponential growth on the Web. Therefore it is not the amount of new information on the Web that is troubling, but rather the lack of hints about the nature of the contents behind the lists returned by search engines and from the … Read entire article »

Filed under: collaborative blogging, dialectic, discussions, signed posts

Mining genres with lexical affect sensing?

Post signed by: Alexander Osherenko, University of Augsburg I gained a comprehensive knowledge in emotion recognition in texts in my PhD thesis “Opinion mining and lexical affect sensing” (http://www.informatik.uni-augsburg.de/~osherenk/promotionsvortrag_english.pdf). In my opinion, this knowledge can be utilized for identifying genres of texts — I don’t think identifying emotions differs much from identifying genres. … Read entire article »

Filed under: computational models, signed posts

Book Review: Genres on the Web (Mehler et al. 2010)

Post signed by: Michael Oakes, University of Sunderland and Uni Research, Bergen. Excerpts from the book review: Alexander Mehler, Serge Sharoff and Marina Santini (eds), Genres on the Web: Computational Models and Empirical Studies. Springer, 2010, 362 pp. Language Resources and Evaluation, Volume 45 / 2011 Preview: http://www.springerlink.com/content/ym07440380524721/ This comprehensive book makes many original contributions to the field of genres on the web. The identification and characterization of genres is of obvious interest to “pure” linguistics, but as this book makes clear, there are some important practical applications. Chief amongst these will be the advent of genre-aware search engines, where users will be able to specify not only their topics of interest, but the desired genre of the returned web pages, as in the WEGA search engine described in this book by Stein, Meyer … Read entire article »

Filed under: reviews, signed posts