Skip to content

How do I get the term frequency and the number of co-occurrences of two terms? #48

@clark874

Description

@clark874

Hi John, another question for you:)

In my study, I tried to use one-mode graphs of terms to demonstrate that some terms are more important than others. I processed the graphml file exported by textnets through networkx. I can get the degree of the node and the value of the weight of the edge. According to the documentation, the latter is obtained by tf-idf calculation and is a floating point value.

My question is: Is there a way to get the term frequency of a term in all the documents and the absolute value of the number of co-occurrences of two terms in all the corpus? Both values should be an integer, is it possible to add these value as a COUNT attribute to the node and edge attributes?

Thanks again for your work!

Metadata

Metadata

Assignees

Labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions