At the Atlanta Semantic Web Meetup tonight, Vishy Dasari gave us a quick description and demo of a new search engine called Semantifi. They purportedly are a search engine for the deep web, meaning the web that is not indexed by traditional search engines because the content is dynamic. They are just in the very [...]
Posts Tagged ‘natural language processing’
Semantifi and the Deep Web
Posted: 6 February 2010 in UncategorizedTags: computational linguistics, natural language processing, search engines, search interfaces, semantic search engine, semantic web, wolfram alpha
NLP Resources for Ruby
Posted: 13 September 2009 in UncategorizedTags: computational linguistics, java, natural language processing, nlp, parsers, python, ruby, stemmers, wordnet
There are quite a few well-known libraries for doing various NLP tasks in Java and Python, such as the Stanford Parser (Java) and the Natural Language Toolkit (Python). For Ruby, there are a few resources out there, but they are usually derivative or not as mature. By derivative, I mean they are ports from other [...]
First Impressions of Wolfram|Alpha
Posted: 16 May 2009 in UncategorizedTags: computational linguistics, google, google squared, knowledge engines, natural language processing, search engines, stephen wolfram, wikipedia, wolfram alpha
Perhaps you’ve heard of the latest brainchild of the Wunderkind Stephen Wolfram: Wolfram|Alpha. Matthew Hurst nicknamed it Alphram today and I agree that’s a much better name. Wolfram|Alpha (W|A henceforth) is not a search engine, it’s a knowledge engine. It will compete with Google on a slice of traffic that Google really isn’t all that [...]
Which CS subfield is your strength?
Posted: 12 March 2008 in UncategorizedTags: artificial intelligence, computational linguistics, computer science, interviews, natural language processing, questions, stress
So I was recently asked (and gave a very bad answer to) a question that has been haunting me ever since. What is the subfield of computer science where I am the strongest? First of all, in my undergraduate training, I was never really introduced to these ideas of subfields of CS explicitly. I knew [...]
Who created OpenEphyra?
Posted: 16 February 2008 in UncategorizedTags: cmu, computational linguistics, information retrieval, lti, natural language processing, open source, openephyra, qa, question answering, software
OpenEphyra is a question answering (QA) system developed here at the Language Technologies Institute by Nico Schlaefer. He began his work at the University of Karlsruhe in Germany, but has since continued it at CMU and is currently a PhD student here. Since it is a home-grown language technologies package, I decided to check it [...]
Cognate Identification: Orthographic Methods
Posted: 26 January 2008 in UncategorizedTags: algorithms, cognate identification, cognates, computational linguistics, historical linguistics, language change, linguistics, machine translation, natural language processing, orthography, string matching
In previous posts on cognate identification, I discussed the difference between strict and loose cognates. Loose cognates are words in two languages that have the same or similar written forms. I also described how approaches to cognate identification tend to differ based on whether the data being used is plain text or phonetic transcriptions. The [...]
FSMNLP 2008 CFP
Posted: 19 January 2008 in UncategorizedTags: call for papers, cfp, computational linguistics, computational morphology, finite state methods, fsmnlp, italy, natural language processing
FSMNLP 2008 (Finite State Methods and Natural Language Processing) has issued their first Call for Papers (CFP). The deadline is May 11, 2008 and the conference will take place on September 11-12, 2008. Not the best time to be travelling perhaps, but this year it will be in Ispra, Lago Maggiore, Italy! That’s in the [...]
Cognate Identification: Approaches
Posted: 10 December 2007 in UncategorizedTags: cognate identification, computational linguistics, historical linguistics, language change, linguistics, natural language processing, phonology
In my previous post on cognate identification, I gave two definitions for cognates: strict and loose (orthographic). Strict cognates are words in two related languages that descended from the same word in the ancestor language. Loose cognates are words in two languages that are spelled or pronounced similarly (depending on the data consists of phonetic [...]
The Noob’s Guide to Parsing
Posted: 25 October 2007 in UncategorizedTags: computational linguistics, how-to, java, natural language processing, nlp, parsing, phpsyntaxtree, tutorial
So you want to automatically parse sentences without having to go through all the trouble of figuring it out for yourself? You’ve come to the right place. This brief tutorial is aimed at students who are interested in computer science and linguistics who maybe want to dip their feet in the water of computational linguistics [...]


