Articles Comments

The WebGenre Blog: The power of genre applied to digital information. By Marina Santini » Entries tagged with "search"

Enterprise Search has a very bright future!

Last updated (Comments): 10 July 2013 On the 30th May 2013, I attended the Findability Day 2013 (findabilityday2013-esli.eventbrite.com) organized by Findwise (www.findwise.com). The gathering of about 200 participants took place in Central Stockholm (Odenplan) in a sunny day, in bright and spacious conference rooms, and in friendly and laid-off atmosphere. The event – “the biggest event on search and findability in Northern Europe”, as the subtitle says – was free of charge (only registration was required) and was sponsored by Google and Splunk. I will not give a complete debrief of the Findability Day 2013 in this post. Martin White has summarized the highlights in his blog (http://www.intranetfocus.com/?p=1295), and Olof Belfrage describes in more details the presentations in a post (http://www.findwise.com/blog/impressions-from-findability-day-2013/) published on Findwise blog. In this post I would like to summarize a … Read entire article »

Filed under: reflections, reports

Thesis Review: Resolving Power of Search Keys

Heppin, Karin Friberg (2010). Resolving Power of Search Keys in MedEval a Swedish Medical Text Collection with User Groups: Doctors and Patients. PhD thesis, Gothenburg University, Sweden Thesis: http://www2.gslt.hum.gu.se/dissertations/friberg.pdf Errata: http://www2.gslt.hum.gu.se/dissertations/friberg.pdf Opponent Stefan Schulz; Defence Presentation: http://user.meduni-graz.at/stefan.schulz/presentations/2010_Gothenburg_Defence.pptx The thesis “Resolving Power of Search Keys in MedEval a Swedish Medical Text Collection with User Groups: Doctors and Patients” opens with crucial questions in Information Retrieval (IR). The general question is: 1. What type of search keys are effective when searching for information in a collection of documents? Language-specific questions refer to how to handle compounds, since around 10%2 of words in Swedish running texts are compounds Then, important questions are: 2. What is the best way to treat compounds? 3. When is it beneficial to use individual compound constituents as search keys and when does it ruin a search? The thesis … Read entire article »

Filed under: reviews

SearchInFocus: Exploratory Study on Query Logs and Actionable Intelligence

Exploratory Query-log Analysis Workshop Thursday, October 25, 2012 from 9:00 AM to 12:00 PM (CEST) Lund, Sweden Organized by Findwise at the Fourth Swedish Language Technology Conference, October 24-26, 2012, Lund. For more information: SLTC 2012. Worshop Description Query logs have been the focus of mostly academic workshops and conferences. At the same time, with the upsurge in high quality search and user-interaction solutions in the enterprise world, companies have a ready access to (very often) large amount of log data. This data, however, whose core is often natural language queries, remains largely unexplored and neglected. The aim of this workshop is to show best practices and standards, new trends, potentials and techniques in order to analyze, draw conclusions, utilize and benefit from the data. The workshop is meant as a forum where people … Read entire article »

Filed under: slides

Beyond Topic: Genre and Search

One of the central problems of information retrieval (IR) is the difficulty of matching a document to a query in the absence of any contextual information about the searchers and the document creators.  Genre is a context carrier and genre information can be exploited by information systems to improve their matching algorithms. The web hosts many recognised genres that can potentially provide this contextual information, including FAQs, press releases, product descriptions, instructions, guides, and reviews. … Read entire article »

Filed under: dialectic, discussions, featured, reading suggestions, references

Abstract: Marrying Relevance and Genre Rankings: an Exploratory Study

Marrying Relevance and Genre Rankings: an Exploratory Study by Pavel Braslavski Amazon.com Widgets Amazon.co.uk Widgets In: Genres on the Web Computational Models and Empirical Studies Alexander Mehler, Serge Sharoff and Marina Santini Text, Speech and Language Technology Volume 42, 2011, DOI: 10.1007/978-90-481-9178-9 Abstract In this chapter, we discuss different options for using genre-related information inWeb search. We conduct an experiment on merging genre-related and text-relevance rankings using a reference Web collection. A method for automatic extraction of formality score akin to readability score using canonical discriminant analysis applied to a sample of genres with decreasing formality is proposed. Effects of aggregating genre-related and text relevance rankings are considered. Evaluation of the results shows moderate positive effects. Findings suggest that further research is needed on implicit use of genre-related information in Web search. … Read entire article »

Filed under: abstracts

Chapter Excerpt: Riding the Rough Waves of Genre on the Web

Riding the Rough Waves of Genre on the Web Concepts and Research Questions Marina Santini, Alexander Mehler, Serge Sharoff In: Genres on the Web Computational Models and Empirical Studies Alexander Mehler, Serge Sharoff and Marina Santini Text, Speech and Language Technology Volume 42, 2011, DOI: 10.1007/978-90-481-9178-9 1 Why is Genre Important? Genre, in the most generic definition, takes the meaning “kind; sort; style” (OED). A more specialised definition of genre in OED reads: “A particular style or category of works of art; esp. a type of literary work characterised by a particular form, style, or purpose.”. Similar definitions are found in other dictionaries, for instance, OALD reads “a particular type or style of literature, art, film or music that you can recognise because of its special features”. Broadly speaking, then, generalising from lexicographic definitions, genre can be seen … Read entire article »

Filed under: chapter excerpts