Follow

Follow

Contact

Home Frequency Distribution of Bigrams

Questions

Frequency Distribution of Bigrams

byMR

December 13, 2021

I have done the following

import nltk


words = nltk.corpus.brown.words()
freq = nltk.FreqDist(words)

And am able to find the frequency of certain words in the brown corpus, like

freq["the"]
62713

But now I want to be able to find the Frequency Distribution of specific bigrams. So then I tried

bigrams = nltk.bigrams(words)
freqbig = nltk.FreqDist(bigrams)

But every bigram that I enter, I always get 0. Like,

freqbig["the man"]
0

What I am doing wrong?

>Solution :

It accepts a tuple as key, not a str:

freqbig[("the", "man")]

OUTPUT

You could create an auxiliary function which takes care of it if you want to pass strings:

def get_frequency(my_string):
    return freqbig[tuple(my_string.split(" "))]

frequency-distribution

byMR

Published December 13, 2021

Add a comment

Leave a ReplyCancel reply

Read more

Questions

How to use pipe from lodash/fp in HTML?

byMR

December 13, 2021

Questions

Dictionary leaf generator

byMR

December 13, 2021

Questions

"Bean type int could not be found" error, how can I solve it?

byMR

December 13, 2021

Questions

What am I doing wrong while trying to sort this array?

byMR

December 13, 2021

Questions

How does std::unique_ptr handle raw pointers/references in a class?

byMR

December 13, 2021

Questions

crc-16 IBM, 0x00 not taken in consideration

byMR

December 13, 2021