Articles Comments

The WebGenre Blog: The power of genre applied to digital information. By Marina Santini » Entries tagged with "Web Genre"

Towards a Computational Theory of Digital Genre (I): Working Definition of Genres for Computational Purposes

Towards a Computational Theory of Digital Genre (I): Working Definition of Genres for Computational Purposes by Marina Santini – Last Updated: 29 Oct 2012 1. What is a (textual) genre? • A genre is a class of texts with similar communicative, textual and linguistic features. 2. What characterizes a genre? A genre: • Must have a name • Must be recognized within a community • Must be produced or retrieved during a task • Must have conventions • Must raise expectations • Can change over time. It is an cultural artifact (culture here includes society, media, techonology, etc.) 3. What characterizes a digital genre? • The same characteristics listed above. • A digital genre is any kind of genre that has a digital form, such as emails, chats, online academic papers, online newspaper articles, blogs… • A digital genre can be any paper genre … Read entire article »

Filed under: dialectic, discussions, dissemination, reflections

Book Outline: Automatic Identification of Genre in Web Pages (2011)

Automatic Identification of Genre in Web Pages: A new perspective [Paperback] Marina Santini (Author) Paperback: 332 pages Publisher: LAP LAMBERT Academic Publishing (December 19, 2011) Language: English ISBN-10: 3847306871 ISBN-13: 978-3847306870 Book Overview This book is divided into five parts: a preliminary part (Part I), three empirical parts (Parts II, III and IV) and an epilogue (Part V). … Read entire article »

Filed under: overviews

Web Genre and the Emergent Properties of Web Sites

Post signed by: Maya Dimitrova, Institute of Control and System Research, Bulgarian Academy of Sciences Previous posts: Gestalt Processes in User-Web Interaction: A Two-Side View & User-Web Interaction: Gestalt in Information Retrieval Maya tries to explain the process of development of the Web as a social medium and the emergence of Web genres with the Gestalt theory, explaining it first from a psychological perspective. 4   User Account of the Emergent Properties of Web Sites 4.1 Holistic understanding of aspects of Web genre In the discussed studies on Web genre a multidimensional structure of the genre components is assumed. For a multidimensional structure, however, it is important to test the orthogonality assumption. The main hypothesis in our studies has been to see if users perceive these dimensions as independent, so that they can be meaningfully displayed as … Read entire article »

Filed under: collaborative blogging, discussions, signed posts

Towards Language–Independent Web Genre Detection (2009)

Poster paper by : Philipp Scholl, Renato Domínguez García, Doreen Böhnstedt, Christoph Rensing, Ralf Steinmetz The term web genre denotes the type of a given web resource, in contrast to the topic of its content. In this research, we focus on recognizing the web genres blog, wiki and forum. We present a set of features that exploit the hierarchical structure of the web page’s HTML mark-up and thus, in contrast to related approaches, do not depend on a linguistic analysis of the page’s content. Our results show that it is possible to achieve a very good accuracy or a fully language independent detection of structured web genres. … Read entire article »

Filed under: abstracts

Abstract: Mining Graph Patterns in Web-based Systems: A Conceptual View

Mining Graph Patterns in Web-based Systems: A Conceptual View by Matthias Dehmer and Frank Emmert-Streib In: Genres on the Web Computational Models and Empirical Studies Alexander Mehler, Serge Sharoff and Marina Santini Text, Speech and Language Technology Volume 42, 2011, DOI: 10.1007/978-90-481-9178-9 Abstract This chapter discusses a graph-based perspective for automatically analyzing web genre data by mining graph patterns representing web-based hypertext structures. The major purpose of our contribution is to emphasize that an approach entirely different to the vector space model, frequently used in Web mining and related problems, can not only be applied to these problems but is more suitable conceptually. The graphs in our study are hierarchical and directed and are called generalized trees. Starting from a similarity measure for determining the structural similarity of generalized trees, we discuss some evaluation steps for automatically … Read entire article »

Filed under: abstracts