5000 Most Common English Words List

photo author
- Senin, 3 April 2023 | 19:48 WIB
Nonton A Serbian Film sub indo (Facebook @Cine de Culto)
Nonton A Serbian Film sub indo (Facebook @Cine de Culto)

5000 Most Common English Words List

# Calculate word frequencies word_freqs = Counter(tokens)

import nltk from nltk.corpus import brown from nltk.tokenize import word_tokenize from collections import Counter 5000 most common english words list

Do you have any specific requirements or applications in mind for this list? 'w') as f: for word

# Get the top 5000 most common words top_5000 = word_freqs.most_common(5000) 5000 most common english words list

# Save the list to a file with open('top_5000_words.txt', 'w') as f: for word, freq in top_5000: f.write(f'{word}\t{freq}\n') Keep in mind that the resulting list might not be perfect, as it depends on the corpus used and the preprocessing steps.

# Tokenize the text and remove stopwords stopwords = nltk.corpus.stopwords.words('english') tokens = [word.lower() for word in brown.words() if word.isalpha() and word.lower() not in stopwords]

# Download the Brown Corpus if not already downloaded nltk.download('brown')

Halaman:
Dilarang mengambil dan/atau menayangkan ulang sebagian atau keseluruhan artikel
di atas untuk konten akun media sosial komersil tanpa seizin redaksi.

Editor: Edy Mufti Es

Sumber: Berbagai sumber

Tags

Artikel Terkait

Rekomendasi

Terkini

X