A Study of the Statistics of Letters in English Words
摘要:
Data which had previously been published by several authors ( Ohlman, 1958; Pratt, 1942; Ohaver, 1933; Smith, 1943; Cox, 1947; Griffith, 1949; Gaines, 1956; Luhn, 1958) to describe the statistical characteristics of English words was examined to show the extent of their agreement. In addition, a detailed empirical study was made of two special types of English word: subject words and proper names. The data for the subject words and proper names was compared with previously reported data on subject words, proper names, and continuous text material. The statistical parameters which were measured and compared are: the distribution of initial letters, the distribution of terminal letters, the composite or total distribution of letters, the distribution of characters for each letter position, the distribution of bigrams, and the distribution of word lengths.
展开
DOI:
10.1016/S0019-9958(61)80036-3
被引量:
年份:
1961
相似文献
参考文献
引证文献
辅助模式
引用
文献可以批量引用啦~
欢迎点我试用!