Tuesday, 4 March 2008

Building an NLP Robot to Live on my Forum

This is starting out really just as a discussion of the technologies involved in order to create a natural language processing agent that would be able to join in conversations (probably in a rather basic way) on my Forum. As I see it the stages required are as follows:

  • Read input
  • Use dictionary database to find word types (probably implementing fuzzy matching to deal with typos). Gcide looks like a good one
  • Build parse tree - verb phrases, noun phrases etc
  • Calculate input semantic category (e.g. question, statement)
  • Semantic processing
  • Judge output semantic category
  • Use semantic rules to create output 'theme'
  • Use syntactic parse tree templates to turn output into natural language
  • Write output
Now of course some of these stages are much more complicated than others