Training an Entity Linker Model with spaCy
Entity linking is the NLP task that maps textual mentions to unique identifiers in external knowledge bases. This chapter explores how to train an entity linking model using spaCy and the best practices on how to create good datasets for NLP training. We will also learn how to use a custom corpus reader to train a spaCy component. With this knowledge, you can customize any of the spaCy components to use while training your models.
We will cover the following in this chapter:
- Understanding the concept and importance of entity linking in NLP
- Best practices for creating high-quality datasets for NLP training
- Training an
EntityLinkercomponent with spaCy - Utilizing a custom corpus reader to train a spaCy component
By the end of this chapter, you will be able to develop NLP models that integrate with external knowledge bases, enhancing accuracy and applicability in real-world scenarios.