Articles Comments

The WebGenre Blog: The power of genre applied to digital information. By Marina Santini » Entries tagged with "information retrieval"

Opinion Retrieval and Ranking: the creeping and ineluctable force of Genre

Last Updated: 27 May 2013 Two fundamental principles concurring to the definition and characterization of the concept of genre are conventions and expectations. Simply put, in textual (written or spoken) communication, genres are words that connote different types of text. For instance, on the web the home page genre is different from the blog genre; in a company, the minutes genre is different from the white paper genre; in the press the leader genre is different from the letter to the editor genre… Genres have the power of shaping information following rhetorical and discourse patterns that have become conventionalized. Genre conventions are implemented by the writer(s). When acknowledged, genre conventions raise predictable expectations in the readers or more generally in those who “process” a text… Although I am oversimplifying here, broadly speaking … Read entire article »

Filed under: discussions, quotes, reflections

Thesis Review: Resolving Power of Search Keys

Heppin, Karin Friberg (2010). Resolving Power of Search Keys in MedEval a Swedish Medical Text Collection with User Groups: Doctors and Patients. PhD thesis, Gothenburg University, Sweden Thesis: http://www2.gslt.hum.gu.se/dissertations/friberg.pdf Errata: http://www2.gslt.hum.gu.se/dissertations/friberg.pdf Opponent Stefan Schulz; Defence Presentation: http://user.meduni-graz.at/stefan.schulz/presentations/2010_Gothenburg_Defence.pptx The thesis “Resolving Power of Search Keys in MedEval a Swedish Medical Text Collection with User Groups: Doctors and Patients” opens with crucial questions in Information Retrieval (IR). The general question is: 1. What type of search keys are effective when searching for information in a collection of documents? Language-specific questions refer to how to handle compounds, since around 10%2 of words in Swedish running texts are compounds Then, important questions are: 2. What is the best way to treat compounds? 3. When is it beneficial to use individual compound constituents as search keys and when does it ruin a search? The thesis … Read entire article »

Filed under: reviews

Reblogging: Informer, Spring Issue

Informer Newsletter of the BCS Information Retrieval Specialist Group http://irsg.bcs.org/informer/ Table of Contents Editorial: By Udo Kruschwitz on April 28, 2012 Conference Review: ECIR 2012 Industry Day: By Franco Maria Nardini on April 26, 2012 Book Review: Search Analytics for Your Site: By Tyler Tate on April 26, 2012 Conference review: ECIR 2012: By Claudia Hauff on April 25, 2012 Call for Book Reviews: By Cathal Gurrin on April 25, 2012 Conference Review: ECIR 2011: By Cathal Gurrin on April 18, 2012 The Information Needs of Mobile Searchers: By Tyler Tate on April 6, 2012 Designing Faceted Search: Getting the basics right (pt 2): By Tony Russell-Rose on April 4, 2012 Events spring 2012: By Andy Macfarlane on March 30, 2012 … Read entire article »

Filed under: dissemination, reading suggestions, reblogging

Dissemination: Acknowledgement Search Engine and Next Generation Search Engines

1)  AckSeer is a beta automatic acknowledgment indexing search engine that explores automatic identification, entity extraction and indexing of acknowledgements from papers. In addition acknowledged entities are extracted within the acknowledgment passages. Currently, AckSeer indexes acknowledgments from more than 500,000 papers in CiteSeerX. These acknowledgements contain more than 4 million acknowledged entities with approximately 2 million of them unique. Entity extraction is based on AlchemyAPI and OpenCalais. Acknowledged entities are ranked by citation. Feedback is most welcomed. http://ackseer.ist.psu.edu 2) Next Generation Search Engines: Advanced Models for Information Retrieval © 2012; Publication Date: March 2012; 560 pages ISBN: 978-1-4666-0330-1; EISBN: 978-1-4666-0331-8 Published by IGI Publishing, Hershey-New York, USA http://www.igi-global.com/book/next-generation-search-engines/59723 … Read entire article »

Filed under: dissemination

User-Web Interaction: Gestalt in Information Retrieval

Post signed by: Maya Dimitrova, Institute of Control and System Research, Bulgarian Academy of Sciences * In this post, all references, figures and tables have been removed by the blog’s moderator. [Part II] 3   Gestalt in Information Retrieval A group of information retrieval studies is concerned with identifying new linguistic, lexical or formal features (like the special tags) that can be captured by automatically processing html scripts – scanning, tokenizing, clustering – and extracting meaningful information to identify the style or genre of the text inside the Web page. Web genre in the discussed group of studies is defined as a multi-dimensional structure of features of text and html design pointing out at various linguistic and cognitive aspects of the retrieved Web document to help the user find not just the relevant topic, but … Read entire article »

Filed under: collaborative blogging, dialectic, discussions, signed posts

Abstract: Marrying Relevance and Genre Rankings: an Exploratory Study

Marrying Relevance and Genre Rankings: an Exploratory Study by Pavel Braslavski Amazon.com Widgets Amazon.co.uk Widgets In: Genres on the Web Computational Models and Empirical Studies Alexander Mehler, Serge Sharoff and Marina Santini Text, Speech and Language Technology Volume 42, 2011, DOI: 10.1007/978-90-481-9178-9 Abstract In this chapter, we discuss different options for using genre-related information inWeb search. We conduct an experiment on merging genre-related and text-relevance rankings using a reference Web collection. A method for automatic extraction of formality score akin to readability score using canonical discriminant analysis applied to a sample of genres with decreasing formality is proposed. Effects of aggregating genre-related and text relevance rankings are considered. Evaluation of the results shows moderate positive effects. Findings suggest that further research is needed on implicit use of genre-related information in Web search. … Read entire article »

Filed under: abstracts

Abstract: Web Genre Analysis: Use Cases, Retrieval Models, and Implementation Issues

Web Genre Analysis: Use Cases, Retrieval Models, and Implementation Issues by Benno Stein, Sven Meyer zu Eissen and Nedim Lipka Amazon.com Widgets Amazon.co.uk Widgets In: Genres on the Web Computational Models and Empirical Studies Alexander Mehler, Serge Sharoff and Marina Santini Text, Speech and Language Technology Volume 42, 2011, DOI: 10.1007/978-90-481-9178-9 Abstract People who search the World Wide Web often have a multi-faceted understanding of their information need: they know what they are searching for, and they know of which form or type the desired documents should be. The former aspect relates to the content of a desired document (= topic), the latter to the presentation of its content and the intended target group. … Read entire article »

Filed under: abstracts

Chapter Excerpt: Riding the Rough Waves of Genre on the Web

Riding the Rough Waves of Genre on the Web Concepts and Research Questions Marina Santini, Alexander Mehler, Serge Sharoff In: Genres on the Web Computational Models and Empirical Studies Alexander Mehler, Serge Sharoff and Marina Santini Text, Speech and Language Technology Volume 42, 2011, DOI: 10.1007/978-90-481-9178-9 1 Why is Genre Important? Genre, in the most generic definition, takes the meaning “kind; sort; style” (OED). A more specialised definition of genre in OED reads: “A particular style or category of works of art; esp. a type of literary work characterised by a particular form, style, or purpose.”. Similar definitions are found in other dictionaries, for instance, OALD reads “a particular type or style of literature, art, film or music that you can recognise because of its special features”. Broadly speaking, then, generalising from lexicographic definitions, genre can be seen … Read entire article »

Filed under: chapter excerpts