Dear All, I am looking for corpora of any genre in the following languages: English, Swedish, Polish, Italian, Finnish, Estonian, and Hungarian. I am already aware of a number of corpora (several posts in this blog are dedicated to the dissemination…
Category: requests
Papageno: Predictive Models for Crisis Intelligence
Last Updated Comments: 22 July 2013 Papageno: A Pilot Study to identify suitable Predictive Models for Crisis Intelligence I need some help to jot down real-world use cases for crisis intelligence. Could you please point out to me past events…
Requests for proposal (RFP) and IR
Last Updated: 1st May 2013 I am looking for a list of functions and features buyers may use in their request for proposal (RFP) to help them acquire an enterprise search/IR platform. Any experience to share about this topic? Any reference…
Towards a Cross-Lingual Lexical Knowledge Base of Lexical Forms
Last updated: 15 May 2013 How do you overcome problems related to cross-linguality? My specific problem at them moment is caused by the poor coverage of everyday language in lexical resources. For instance, the Swedish single-word expression /egenremiss/ (14,900 hits,…
Request: Corpus-Based Sublanguage Glossary
How to build a glossary of: specialized term = common word automatically? Dear all, I wonder if you have any experience or if you can provide references on how to build automatically a glossary from genre-specific corpora. The glossary should…
Question: How to Define Criteria for Subgenre Classification?
I had an interesting email exchange with Christophe Clugston, a researcher currently located in Thailand, about the classification of a specific subgenre belonging to the Netadvertising supergenre. He says: “I am looking at classifying a very narrow sub genre. Within…
Actionable Corpus & Actionable Intelligence
I am trying to figure out how to predict future trends independently from entities. For example, instead of trying to guess who (Obama and Romney are two entities) will win next American elections, I would like to predict the trend…
Request: Looking for Multi-Dimensional Social Network Datasets/Corpora/Collections
Is anyone aware of multi-dimensional social network datasets/corpora/collections where friendships are based on several attributes? For example, A is friend with B because they are co-author. Or, A is friend with C because they play badminton. Generally, Facebook-based datasets describe…
Text/Content Analytics for Suicide Prevention (I)
A interesting topic has been brought to my attention almost simultaneously by two friends working in very different areas (namely by a linguist and a psychiatrist): the language of suicides. My mind has immediately converted their differing perspectives on the…
Applying Findability to Mine Query Logs for BI: Preliminaries
Marina Santini. Copyright © 2012 Thanks for sharing pointers and for giving hints to the question: “Can anyone suggest references about mining query logs for BI and CEM?” (http://www.forum.santini.se/2012/05/mining-query-logs-for-bi-and-cem/). Pls feel free to add comments to the blog post, if…