Articles Comments

Headline

Papageno: Predictive Models for Crisis Intelligence

Last Updated: Stockholm, 15 May 2013 Papageno: A Pilot Study to identify suitable Predictive Models for Crisis Intelligence I need some help to jot down real-world use cases for crisis intelligence.  Could you please point out to me past events or previous experiences that can be useful for a pilot study? ”Crisis intelligence” is a new research area that is becoming more and more crucial in medium-large organizations and companies. It consists in detecting an upcoming “crisis” (a scandal or general dissatisfaction or any negative attitude) by automatically analysing text documents of any kind in electronic format. Many commercial and open source solutions are proposed to identify the “mood” and the sentiment of masses with respect to a certain event, brand, or person through tweets, blogs, etc. But very little research has been carried … Read entire article »

Latest

Requests for proposal (RFP) and IR

Last Updated: 1st May 2013 I am looking for a list of functions and features buyers may use in their request for proposal (RFP) to help them acquire an enterprise search/IR platform. Any experience to share about this topic? Any reference that can help analyze this problem in depth? Thanks in advance.   Bookmark on Delicious Recommend on Facebook Share on Linkedin Tweet about it Subscribe to the comments on this post … Read entire article »

Towards a Cross-Lingual Lexical Knowledge Base of Lexical Forms

Last updated: 15 May 2013 How do you overcome problems related to cross-linguality? My specific problem at them moment is caused by the poor coverage of everyday language in lexical resources. For instance, the Swedish single-word expression /egenremiss/ (14,900 hits, Google.se April 2013) – or alternatively as a a multiword expession (MWE) – /egen remiss/ (8,210 hits, Google.se April 2013) denotes a referral to a specialist doctor written by patients themselves. This expression is made up from two common Swedish words /egen/ `own (adj)’ and /remiss/ `referral’. It is a recent expression (probably coined around 2010*) and not yet recorded in any official dictionary nor in Wiktionary or other multilingual online lexical resources. This compound happens … Read entire article »

Presentation: Text analytics and R – Open Question: is it a good match?

* The Quest: finding the optimal way to handle Big Textual Data for Information Discovery & Actionable Intelligence * The Open Question: is R convenient for text analytics of Big TEXTUAL Data? * The Mission: identification of pros, cons, limits, benefits Current Status: investigation in progress… Live casts of the this R meetup are available here: First talk: Text analytics and R by Marina Santini Second and third talks: Wordclouds from Twitter with R by Måns Magnusson and An example of text analytics in R by Joakim Lundborg http://www.meetup.com/StockholmR/pages/Live_casts_from_past_meetups/ You can find R code suggestions in this thread:  http://www.meetup.com/StockholmR/events/103353372/?&a=uc1_te Bookmark on Delicious Recommend on Facebook Share on Linkedin Tweet about it Subscribe to the comments on this post … Read entire article »

Presentation: How Emotional Are Users’ Needs? Emotion in Query Logs

According to recent IR research, searchers’ behaviour is not only limited to traditional informational, navigational and transactional needs. A novel hypothesis is that the seeking behaviour is driven by emotion. These experiments are part of SearchInFocus, a study centred on search. How Emotional Are Users’ Needs? Emotion in Query Logs from Marina Santini http://www.cyberemotions.eu/ Bookmark on Delicious Recommend on Facebook Share on Linkedin Tweet about it Subscribe to the comments on this post … Read entire article »

Book in Preparation: A Computational Theory of Digital Genre

Book in preparation: A Computational Theory of Digital Genre by Marina Santini The book lists, examines and develops the key concepts necessary to build a novel, intuitive and robust definition of digital genre for computational purposes. The newly proposed definition is the tenet of the computational theory underlying computational models for automatic digital genre classification. The book is divided into six parts, each one discussing exhaustively issues that have been neglected or considered to be too controvertial to find any theoretical or pragmatic agreement among scholars or researchers. The book provides not only theoretical foundations, but also a number of use cases, corpora/datasets, and computational models that readers can re-use for their own experiments to evaluate … Read entire article »

Thesis Review: Resolving Power of Search Keys

Heppin, Karin Friberg (2010). Resolving Power of Search Keys in MedEval a Swedish Medical Text Collection with User Groups: Doctors and Patients. PhD thesis, Gothenburg University, Sweden Thesis: http://www2.gslt.hum.gu.se/dissertations/friberg.pdf Errata: http://www2.gslt.hum.gu.se/dissertations/friberg.pdf Opponent Stefan Schulz; Defence Presentation: http://user.meduni-graz.at/stefan.schulz/presentations/2010_Gothenburg_Defence.pptx The thesis “Resolving Power of Search Keys in MedEval a Swedish Medical Text Collection with User Groups: Doctors and Patients” opens with crucial questions in Information Retrieval (IR). The general question is: 1. What type of search keys are effective when searching for information in a collection of documents? Language-specific questions refer to how to handle compounds, since around 10%2 of words in Swedish running texts are compounds Then, important questions are: 2. What is the best way to treat compounds? 3. When is it beneficial to … Read entire article »

Request: Corpus-Based Sublanguage Glossary

How to build a glossary of: specialized term = common word automatically? Dear all, I wonder if you have any experience or if you can provide references on how to build automatically  a glossary from genre-specific corpora. The glossary should be made of pairs in the form of: sublangage term = common/familiar word. For instance: anemi = blood deficiency analgesic = painkiller etc. Thanks in advance for suggestions and pointers. Marina   Bookmark on Delicious Recommend on Facebook Share on Linkedin Tweet about it Subscribe to the comments on this post … Read entire article »

Reflection: Analysing Emotions of Social Writing

by Marina Santini A few days ago, I attended a fascinating session organized by the Quantified Self Stockholm (QS) MeetuUp, in a venue with an inspiring name, Psykologifabriken (The Psychology Factory), in center Stockholm. This QS session – Adding Power to body and soul… – included two presentations: one about adding power to the body through a robotic glove that adds gripping energy to the hand of those who have lost strength in this limb; the other one about methods to enable self-development through digital tools. Since I am not into robotics, I will only say that the empowering glove shown by Johan Ingvast from Bioservo is simply amazing… I am not a psychologist either, but I found the presentation … Read entire article »

Question: How to Define Criteria for Subgenre Classification?

I had an interesting email exchange with Christophe Clugston, a researcher currently located in Thailand, about the classification of a specific subgenre belonging to the Netadvertising supergenre. He says: “I am looking at classifying a very narrow sub genre. Within the domain of Netvertising I am looking at an extant, variant genre that I am terming Long Scroll Web Advertisements (as the off line version is termed Long Copy Advertising). This type of advertising is very different than the multi media image tied to a few words or few clauses. It is based entirely on the factor of extended reading (some of these ads are over 24 pages when printed). I have enclosed a link to … Read entire article »

Follow

Get every new post on this blog delivered to your Inbox.

Join other followers: