Author
Abstract
Discussion of the possibilities and limitations of Language Statistics. Sampling of “normaI” prose is difficult as the “universe of written Dutch” is undefined. What predictions can be made about the contents of tomorrow's newspaper? The letter frequencies are very stable. Frequencies ofbigrams and trigrams of letters bring us towards the word level about which something may be predicted. The sentence level can only be dealt with after a mechanical sentence analysis method is available. The highest level is still outside the scientific domain and belongs to the literary critic. Literary Statistics should be founded on a solid knowledge of Language Statistics. It is considered inappropriate to begin this field with difficult historical problems. The mutual distrust between linguist and statistician requires much tact on both sides. Eventually Language Statistics may become one of the bridges between the “two cultures”. The modern computer is an indispensable tool both for practical reasons (the giant mass of material) and for theoretical ones (the need to give unambiguous definitions of concepts like “sentence”, “word” and “syliable”). The use of Language Statistics in Mechanical Translation research is discussed. Review of the activities in the Netherlands. A one million word count is on its way. The speaker pleads for a National Center for Lexicology and Language Statistics.
Suggested Citation
H. Brandt Corstius, 1964.
"Taalstatistiek,"
Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 18(4), pages 353-367, December.
Handle:
RePEc:bla:stanee:v:18:y:1964:i:4:p:353-367
DOI: 10.1111/j.1467-9574.1964.tb00523.x
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:stanee:v:18:y:1964:i:4:p:353-367. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0039-0402 .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.