"വിക്കിനിഘണ്ടു:ഏറ്റവും സാധാരണമായ വാക്കുകളുടെ പട്ടിക" എന്ന താളിന്റെ പതിപ്പുകൾ തമ്മിലുള്ള വ്യത്യാസം

Jacob.jose (സന്ദേശങ്ങള്‍) ചെയ്ത തിരുത്തല്‍ 19192 നീക്കം ചെയ്യുന്നു
വരി 10:
Here are the top 100 words (from tv scripts) in alphabetical order:
 
:[[abouta]] · [[allabout]] · [[andall]] · [[areand]] · [[asare]] · [[atas]] · [[aat]] · [[back]] · [[becausebe]] · [[beenbecause]] · [[bebeen]] · [[but]] · [[can't]] · [[can't]] · [[come]] · [[could]] · [[didn'tdid]] · [[diddidn't]] · [[don'tdo]] · [[dodon't]] · [[for]] · [[from]] · [[get]] · [[goinggo]] · [[goodgoing]] · [[gotgood]] · [[gogot]] · [[had]] · [[have]] · [[he's]] · [[hereher]] · [[herhere]] · [[heyhe's]] · [[hehey]] · [[him]] · [[his]] · [[how]] · [[I'll]] · [[I'mif]] · [[ifI'll]] · [[inI'm]] · [[isin]] · [[it'sis]] · [[it]] · [[Iit's]] · [[just]] · [[know]] · [[like]] · [[look]] · [[meanme]] · [[memean]] · [[my]] · [[notno]] · [[nownot]] · [[nonow]] · [[of]] · [[oh]] · [[okayOK]] · [[OK|okokay]]  · [[oneon]] · [[onone]] · [[or]] · [[out]] · [[really]] · [[right]] · [[say]] · [[see]] · [[she]] · [[somethingso]] · [[some]] · [[sosomething]] · [[tell]] · [[that's]] · [[that's]] · [[thenthe]] · [[therethen]] · [[theythere]] · [[thethey]] · [[think]] · [[this]] · [[time]] · [[to]] · [[up]] · [[want]] · [[was]] · [[wellwe]] · [[werewell]] · [[wewere]] · [[what]] · [[when]] · [[who]] · [[why]] · [[will]] · [[with]] · [[would]] · [[yeah]] · [[yes]] · [[you're]] · [[your]] · [[you're]]
 
Here they are in frequency order:
വരി 22:
:[[Wiktionary:Frequency lists/TV/2006/40001-41284|40001-41284]] (the dregs that were tied for 40,000th place)
 
That'll probably be it. It's a third of all the unique words. The rest were used five5 or fewer times each.
 
===ഏറ്റവും സാധാരണയായ വാക്കുകള്‍ (ഗുട്ടന്‍ബര്‍ഗ്)===
These lists are the most frequent words, when performing a simple, straight (obvious) frequency count of all the books found on [[wikipedia:Project Gutenberg|Project Gutenberg]]. The list of books was downloaded in July of 2005, and "[[w:rsync|rsync]]"'ed monthly thereafter. These are mostly English words, with some other languages finding representation to a lesser extent. Note that manyMany Project Gutenberg books are scanned once their copyright expires, typically book editions published [[w:Template:PD-US|before 1923]], so the language maydoes not exactly represent modern usage. For Noteexample, also"hath" thatis listed as the 534th-most-common word. Also, with 24,000+ books, the text of the boilerplate warning for Project Gutenberg appears on each of them.
 
Here are the top 100 words (from Project Gutenberg texts) in alphabetical order:
:[[abouta]] · [[afterabout]] · [[allafter]] · [[andall]] · [[anyand]] · [[anany]] · [[arean]] · [[asare]] · [[atas]] · [[aat]] · [[been]] · [[before]] · [[be]] · [[but]] · [[by]] · [[can]] · [[could]] · [[did]] · [[down]] · [[do]] · [[first]] · [[for]] · [[from]] · [[good]] · [[great]] · [[had]] · [[has]] · [[have]] · [[her]] · [[he]] · [[him]] · [[his]] · [[if]] · [[into]] · [[in]] · [[is]] · [[its]] · [[it]] · [[I]] · [[know]] · [[like]] · [[little]] · [[made]] · [[man]] · [[may]] · [[men]] · [[me]] · [[more]] · [[Mr]] · [[much]] · [[must]] · [[my]] · [[not]] · [[now]] · [[no]] · [[of]] · [[oneon]] · [[onlyone]] · [[ononly]] · [[or]] · [[other]] · [[our]] · [[out]] · [[over]] · [[said]] · [[see]] · [[she]] · [[should]] · [[some]] · [[so]] · [[such]] · [[than]] · [[that]] · [[theirthe]] · [[themtheir]] · [[thenthem]] · [[therethen]] · [[thesethere]] · [[theythese]] · [[thethey]] · [[this]] · [[time]] · [[to]] · [[two]] · [[upon]] · [[up]] · [[us]] · [[very]] · [[was]] · [[were]] · [[we]] · [[what]] · [[when]] · [[which]] · [[who]] · [[will]] · [[with]] · [[would]] · [[youryou]] · [[youyour]]
 
*These [[wikified]] terms can be copied to other language wiktionaries..., this is what they are intended for. If you do, please add an [[interwiki]] link onto the page here.
 
So far:
[[fi:Wikisanakirja:Frequency lists/PG/2006/04/1-10000]]
[[:fi:Wikisanakirja:Frequency lists/PG/2006/04/1-10000|Finnish]] -
[[fr:Wiktionnaire:Listes de fréquence]]
[[:fr:Wiktionnaire:Listes de fréquence|French]] -
[[la:Victionarium:Dictiones_in_Project_Gutenberg_per_frequentiam|Latin]]
[[:la:Victionarium:Dictiones_in_Project_Gutenberg_per_frequentiam|Latin]] -
[[ta:விக்சனரி:frequency lists|Tamil]]
[[:ta:விக்சனரி:frequency lists|Tamil]]
[[es:Wikcionario:Palabras más frecuentes del español]]
 
:New list as of 4/16/2006:
Line 70 ⟶ 59:
*[[Wiktionary:Frequency lists/Project Gutenberg 90001-100000]]
 
::AppoximatelyApproximately ''24,197 files, 1,712,082,956 words, 70,756.0 average words/file.'' from which were gleaned about 9,053,310 unique "words."
 
*From the straight frequency count, the current copy of Wiktionary was then removed from that list. Even entries that only have a redirect were removed.
Line 83 ⟶ 72:
 
: [[User:Connel MacKenzie/Gutenberg]]
 
===Most common words in contemporary fiction===
 
The 2,000 most common words in contemporary fiction can be found here:
*[[Wiktionary:Frequency lists/Contemporary fiction]].
 
The 2,000 most common words in contemporary fiction can be found here divided into 60 subject categories.
*[[Wiktionary:Frequency lists/Contemporary fiction in 60 categories]].
 
This lumps regular lemmas of the same word together, unlike most of these lists.
 
===Top English words lists===