"വിക്കിനിഘണ്ടു:ഏറ്റവും സാധാരണമായ വാക്കുകളുടെ പട്ടിക" എന്ന താളിന്റെ പതിപ്പുകൾ തമ്മിലുള്ള വ്യത്യാസം

Jacob.jose (സന്ദേശങ്ങള്‍) ചെയ്ത തിരുത്തല്‍ 19192 നീക്കം ചെയ്യുന്നു
വരി 10:
Here are the top 100 words (from tv scripts) in alphabetical order:
 
:[[aabout]] · [[aboutall]] · [[alland]] · [[andare]] · [[areas]] · [[asat]] · [[ata]] · [[back]] · [[bebecause]] · [[becausebeen]] · [[beenbe]] · [[but]] · [[can't]] · [[can't]] · [[come]] · [[could]] · [[diddidn't]] · [[didn'tdid]] · [[dodon't]] · [[don'tdo]] · [[for]] · [[from]] · [[get]] · [[gogoing]] · [[goinggood]] · [[goodgot]] · [[gotgo]] · [[had]] · [[have]] · [[he's]] · [[herhere]] · [[hereher]] · [[he'shey]] · [[heyhe]] · [[him]] · [[his]] · [[how]] · [[I'll]] · [[ifI'm]] · [[I'llif]] · [[I'min]] · [[inis]] · [[isit's]] · [[it]] · [[it'sI]] · [[just]] · [[know]] · [[like]] · [[look]] · [[memean]] · [[meanme]] · [[my]] · [[nonot]] · [[notnow]] · [[nowno]] · [[of]] · [[oh]] · [[OKokay]] · [[okayOK|ok]]  · [[onone]] · [[oneon]] · [[or]] · [[out]] · [[really]] · [[right]] · [[say]] · [[see]] · [[she]] · [[sosomething]] · [[some]] · [[somethingso]] · [[tell]] · [[that's]] · [[that's]] · [[thethen]] · [[thenthere]] · [[therethey]] · [[theythe]] · [[think]] · [[this]] · [[time]] · [[to]] · [[up]] · [[want]] · [[was]] · [[wewell]] · [[wellwere]] · [[werewe]] · [[what]] · [[when]] · [[who]] · [[why]] · [[will]] · [[with]] · [[would]] · [[yeah]] · [[yes]] · [[you're]] · [[your]] · [[you're]]
 
Here they are in frequency order:
വരി 22:
:[[Wiktionary:Frequency lists/TV/2006/40001-41284|40001-41284]] (the dregs that were tied for 40,000th place)
 
That'll probably be it. It's a third of all the unique words. The rest were used 5five or fewer times each.
 
===ഏറ്റവും സാധാരണയായ വാക്കുകള്‍ (ഗുട്ടന്‍ബര്‍ഗ്)===
These lists are the most frequent words, when performing a simple, straight (obvious) frequency count of all the books found on [[wikipedia:Project Gutenberg|Project Gutenberg]]. The list of books was downloaded in July of 2005, and "[[w:rsync|rsync]]"'ed monthly thereafter. These are mostly English words, with some other languages finding representation to a lesser extent. ManyNote that many Project Gutenberg books are scanned once their copyright expires, typically book editions published [[w:Template:PD-US|before 1923]], so the language doesmay not exactly represent modern usage. For example,Note "hath"also is listed as the 534th-most-common word. Also,that with 24,000+ books, the text of the boilerplate warning for Project Gutenberg appears on each of them.
 
Here are the top 100 words (from Project Gutenberg texts) in alphabetical order:
:[[aabout]] · [[aboutafter]] · [[afterall]] · [[alland]] · [[andany]] · [[anyan]] · [[anare]] · [[areas]] · [[asat]] · [[ata]] · [[been]] · [[before]] · [[be]] · [[but]] · [[by]] · [[can]] · [[could]] · [[did]] · [[down]] · [[do]] · [[first]] · [[for]] · [[from]] · [[good]] · [[great]] · [[had]] · [[has]] · [[have]] · [[her]] · [[he]] · [[him]] · [[his]] · [[if]] · [[into]] · [[in]] · [[is]] · [[its]] · [[it]] · [[I]] · [[know]] · [[like]] · [[little]] · [[made]] · [[man]] · [[may]] · [[men]] · [[me]] · [[more]] · [[Mr]] · [[much]] · [[must]] · [[my]] · [[not]] · [[now]] · [[no]] · [[of]] · [[onone]] · [[oneonly]] · [[onlyon]] · [[or]] · [[other]] · [[our]] · [[out]] · [[over]] · [[said]] · [[see]] · [[she]] · [[should]] · [[some]] · [[so]] · [[such]] · [[than]] · [[that]] · [[thetheir]] · [[theirthem]] · [[themthen]] · [[thenthere]] · [[therethese]] · [[thesethey]] · [[theythe]] · [[this]] · [[time]] · [[to]] · [[two]] · [[upon]] · [[up]] · [[us]] · [[very]] · [[was]] · [[were]] · [[we]] · [[what]] · [[when]] · [[which]] · [[who]] · [[will]] · [[with]] · [[would]] · [[youyour]] · [[youryou]]
 
*These [[wikified]] terms can be copied to other language wiktionaries,... this is what they are intended for. If you do, please add an [[interwiki]] link onto the page here.
 
So far:
[[fi:Wikisanakirja:Frequency lists/PG/2006/04/1-10000]]
[[:fi:Wikisanakirja:Frequency lists/PG/2006/04/1-10000|Finnish]] -
[[fr:Wiktionnaire:Listes de fréquence]]
[[:fr:Wiktionnaire:Listes de fréquence|French]] -
[[la:Victionarium:Dictiones_in_Project_Gutenberg_per_frequentiam|Latin]]
[[:la:Victionarium:Dictiones_in_Project_Gutenberg_per_frequentiam|Latin]] -
[[ta:விக்சனரி:frequency lists|Tamil]]
[[:ta:விக்சனரி:frequency lists|Tamil]]
[[es:Wikcionario:Palabras más frecuentes del español]]
 
:New list as of 4/16/2006:
Line 59 ⟶ 70:
*[[Wiktionary:Frequency lists/Project Gutenberg 90001-100000]]
 
::ApproximatelyAppoximately ''24,197 files, 1,712,082,956 words, 70,756.0 average words/file.'' from which were gleaned about 9,053,310 unique "words."
 
*From the straight frequency count, the current copy of Wiktionary was then removed from that list. Even entries that only have a redirect were removed.
Line 72 ⟶ 83:
 
: [[User:Connel MacKenzie/Gutenberg]]
 
===Most common words in contemporary fiction===
 
The 2,000 most common words in contemporary fiction can be found here:
*[[Wiktionary:Frequency lists/Contemporary fiction]].
 
The 2,000 most common words in contemporary fiction can be found here divided into 60 subject categories.
*[[Wiktionary:Frequency lists/Contemporary fiction in 60 categories]].
 
This lumps regular lemmas of the same word together, unlike most of these lists.
 
===Top English words lists===