[1] N. Bernstein-Ratner. The phonology in parent child speech. In K. Nelson and A. van Kleeck, editors, Children's Language, volume 6. Erlbaum, Hillsdale, NJ, 1987. | |
[2] M. R. Brent. An e#cient, probabilistically sound algorithm for segmentation and word discovery. Machine Learning, 34:71--106, 1999. | |
[3] M. R. Brent and T. A. Cartwright. Distributional regularity and phonological constraints are useful for segmentation. Cognition, 61:93--125, 1996. | |
[4] G. Chaitin. On the length of programs for computing finite binary sequences. J. Assoc. Comput. Math., 13:547--569, 1966. Chunyu Kit 35 | |
[5] N. Chomsky. Syntactic Structure. Mouton, Hague, 1957. | |
[6] N. Chomsky. The Minimalist Program. MIT Press, Cambridge, MA., 1995. | |
[7] T. M. Cover and J. A. Thomas. Elements of Information Theory. John Wiley and Sons, Inc., New York, 1991. | |
[8] C. de Marcken. The unsupervised acquisition of a lexicon from continuous speech. Technical Report A.I. Memo No. 1558, AI Lab., MIT, Cambridge, Massachusetts, November 1995. | |
[9] C. de Marcken. Unsupervised Language Acquisition. PhD thesis, MIT, Cambridge, Massachusetts, 1996. | |
[10] C. Kit. Unsupervised Lexical Learning as Inductive Inference. PhD thesis, University of She#eld, UK, 2000. | |
[11] C. Kit and Y. Wilks. The Virtual Corpus approach to deriving n-gram statistics from large scale corpora. In C. Huang, editor, Proceedings of 1998 International Conference on Chinese Information Processing, pages 223--229, Beijing, November 1998. | |
[12] C. Kit and Y. Wilks. Unsupervised learning of word boundary with description length gain. In M. Osborne and E. T. K. Sang, editors, CoNLL-99, pages 1--6, Bergen, June 1999. | |
[13] A. N. Kolmogorov. Three approaches for defining the concept of ``information quantity''. Problem of Information Transmission, 1:4--7, 1965. | |
[14] M. Li and P. M. B. Vit’anyi. Introduction to Kolmogorov Complexity and its Applications. SpringerVerlag, New York, 1993. Second edition, 1997. | |
[15] B. MacWhinney. The CHILDES Database. Discovery Systems, Dublin, OH, 1991. | |
[16] B. MacWhinney and C. Snow. The child language data exchange system. Journal of Child Language, 12:171--296, 1985. | |
[17] U. Manber and E. Myers. Su#x array: a new method for on-line string searches. In First ASM-SIAM Symposium on Discrete Algorithms, pages 319--327, Providence, 1990. American Mathematical Society. | |
[18] D. C. Olivier. Stochastic Grammars and Language Acquisition Mechanisms. PhD thesis, Harvard University, Cambridge, MA, 1968. | |
[19] A. Peters. The Units of Language Acquisition. Cambridge University Press, Cambridge, England, 1983. | |
[20] J. Rissanen. Modelling by shortest data description. Automatica, 14:465--471, 1978. | |
[21] J. Rissanen. Stochastic Complexity in Statistical Inquiry. World Scientific, N.J., 1989. | |
[22] J. Rissanen and E. S. Ristad. Language acquisition in the MDL framework. In E. Ristad, editor, Language Computations. American Mathematical Society, Philadelphia, PA, 1994. | |
[23] C. Shannon. A mathematical theory of communication. Bell System Technical Journal, 27:379--423, 623--656, 1948. | |
[24] R. J. Solomono#. A formal theory of inductive inference, part 1 and 2. Information Control, 7:1--22, 224--256, 1964. | |
[25] C. J. van Rijsbergen. Information Retrieval. Butterworths, London, 2nd edition, 1979. | |
[26] A. Venkataraman. A statistical model for word discovery in transcribed speech. Computational Linguistics, 27(3):352--372, 2001. | |
[27] P. M. B. Vit’anyi and M. Li. On prediction by data compression. In Proc. 9th European Conference on Machine Learning, pages 14--30, Heidelberg, 1997. Springer-Verlag. Lecture Notes in Artificial Intelligence, Vol. 1224. | |
[28] P. M. B. Vit’anyi and M. Li. Minimum description length induction, Bayesianism, and Kolmogorov complexity. IEEE Transactions On Information Theory, IT-46(2):446--464, 2000. 36 Unsupervised Lexical Learning as Inductive Inference via Compression | |
[29] C. S. Wallace and D. M. Boulton. An information measure for classification. The Computer Journal, 11:185--195, 1968. | |
[30] C. S. Wallace and P. R. Freeman. Estimation and inference by compact coding. Journal of the Royal Statistical Society, 49:240--251, 1987. Discussion pages 251-265. | |
[31] G. K. Zipf. Human Behaviour and the Principle of Least E#ort. Hafner, New York, 1949. |
| HOME :: Back to the Paper :: References | Comments to: junwang4 you-know-at gmail.com | Last update: 11/16/07 |