I recently received an email from a US undergraduate interested in tools and resources for NLP, particularly free tagged corpora. Luckily, the NLP field has matured into an open access friendly crowd, so there are lots of resources freely available. Maybe too many. To be honest, too many search result hits is a pain. Newbies aren't looking for ridiculously long lists of resources which they have to pick through exactly BECAUSE they're newbies! They don't know how to choose between them. And all too often expert/experienced NLPers will simply push their pet language or resources not because its appropriate for newbies, but because it's the pet of the expert.
So my unsolicited teachable moment #333256: give newbies/students recommendations that are appropriate for them, not appropriate for you.
For example, with all due respect, no newbie NLPer should go anywhere near the Stanford NLP Annotated List of Resources. I'm the first to admit that's a GREAT list of resources. No argument from me. But most of those resources requires at least basic familiarization with NLP before starting (most require more).
For true newbies, The Natural Language Toolkit remains my preferred option. Its excellent teaching book, tutorials, packaged corpora and data, and solid documentation make it the reigning king of NLP intro tools. Plus, it's a mature enough toolkit to be used for more extensive projects. Hard to go wrong.
FWIW, This post was not a paid endorsement of any kind. I have no professional or personal relationship with anyone involved in the NLTK. I follow several people involved with the project on Twitter. That's as close to a personal involvement as I get. This post is not meant as a commercial advertisement, but rather as my own personal opinion.
Showing posts with label NLTK. Show all posts
Showing posts with label NLTK. Show all posts
Sunday, January 27, 2013
Wednesday, September 26, 2007
Intro to CL Books ...
Bob Carpenter has blogged about a new Intro to IR book online here. I'm looking forward to skimming it this weekend. I would also recommend the Python based NLTK Toolkit.
Books and resources like these are generally geared towards people with existing programming background. If a linguist with no programming skills is interested in learning some computational linguistics, Mike Hammond has written a couple of novice's intro books called Programming For Linguists. A novice would be wise to start with Hammond's books, move to the NLTK tutorials, then move on to a more serious book like Manning et al.
And if you're at all curious about what a linguist might DO once she has worked through all that wonderful material, you might could go to my own most wonderful List of Companies That Hire Computational Linguists page here.
And if you're not challenged by any of that above, I dare you to read Bob's Type-Logical Semantics. Go on, you think yer all smart and such. I dare ya! I read it the summer of 1999 with a semanticist, a logician, and a computer scientist and it made all of our heads hurt. I still have Chapter 10 nightmares.
Books and resources like these are generally geared towards people with existing programming background. If a linguist with no programming skills is interested in learning some computational linguistics, Mike Hammond has written a couple of novice's intro books called Programming For Linguists. A novice would be wise to start with Hammond's books, move to the NLTK tutorials, then move on to a more serious book like Manning et al.
And if you're at all curious about what a linguist might DO once she has worked through all that wonderful material, you might could go to my own most wonderful List of Companies That Hire Computational Linguists page here.
And if you're not challenged by any of that above, I dare you to read Bob's Type-Logical Semantics. Go on, you think yer all smart and such. I dare ya! I read it the summer of 1999 with a semanticist, a logician, and a computer scientist and it made all of our heads hurt. I still have Chapter 10 nightmares.
Subscribe to:
Comments (Atom)
TV Linguistics - Pronouncify.com and the fictional Princeton Linguistics department
[reposted from 11/20/10] I spent Thursday night on a plane so I missed 30 Rock and the most linguistics oriented sit-com episode since ...
-
Matt Damon's latest hit movie Elysium has a few linguistic oddities worth pointing out. The film takes place in a dystopian future set i...
-
Bob Carpenter recently made the following comment on one of my posts: I'm very excited to hear that linguists are beginning to take sta...
-
[reposted from 11/20/10] I spent Thursday night on a plane so I missed 30 Rock and the most linguistics oriented sit-com episode since ...