You're reading from Machine Learning Techniques for Text Apply modern techniques with Python for text processing, dimensionality reduction, classification, and evaluation

Product type Paperback

Published in Oct 2022

Publisher Packt

ISBN-13 9781803242385

Length 448 pages

Edition 1st Edition

Languages

Python

Concepts

Machine Learning

Author (1):

Nikos Tsourakis

View More author details

Table of Contents (13) Chapters

Preface

1. Chapter 1: Introducing Machine Learning for Text

2. Chapter 2: Detecting Spam Emails FREE CHAPTER

3. Chapter 3: Classifying Topics of Newsgroup Posts

4. Chapter 4: Extracting Sentiments from Product Reviews

5. Chapter 5: Recommending Music Titles

6. Chapter 6: Teaching Machines to Translate

7. Chapter 7: Summarizing Wikipedia Articles

8. Chapter 8: Detecting Hateful and Offensive Language

9. Chapter 9: Generating Text in Chatbots

10. Chapter 10: Clustering Speech-to-Text Transcriptions

11. Index

Why subscribe?

12. Other Books You May Enjoy

Performing extractive summarization

In the chapter’s introduction, we mentioned that extractive summarization identifies important words or phrases and stitches them together to produce a condensed version of the original text. In this section, we use the previously created books.json file and employ different methods to extract summaries for an input document. Due to space limitations and the need to focus on state-of-the-art techniques, we do not present the theory behind the methods. However, there is a plethora of online resources that can be consulted. A good starting point is the following link: https://miso-belica.github.io/sumy/summarizators.html.

Let’s begin by loading the data from the file and printing a few examples:

import pandas as pd
df = pd.read_json('books.json')
df.head()
>>  title                    product_description...

The rest of the chapter is locked

Tech Concepts

Programming languages

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

You're reading from Machine Learning Techniques for Text Apply modern techniques with Python for text processing, dimensionality reduction, classification, and evaluation

Table of Contents (13) Chapters

Performing extractive summarization

Authors (1)

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access