HOME   ::   Back to the Paper   ::   References

Roy, D. (2001) Learning visually grounded words and syntax of natural spoken language. Evolution of Communication, 4(1):33--56.

References (may not be complete)  [Original format]  [Sort by year]  [Sort by author]  [Sort by citations]

Agre, P. (1988). The dynamic structure of everyday life (Tech. Rep. No. 1085). MIT Artificial Intelligence Laboratory.

Google

Aslin, R., Woodward, J., LaMendola, N., & Bever, T. (1996). Models of word segmentation in fluent maternal speech to infants. In J. L. Morgan & K. Demuth (Eds.), Signal to syntax (p. 117-134). Mahwah, NJ: Erlbaum.

Google

Bailey, D., Feldman, J., Narayanan, S., & Lakoff, G. (1997). Embodied lexical development. In Proceedings of the nineteenth annual meeting of the cognitive science society. Mahwah, NJ: Erlbaum.

Google

Barsalou, L. (1999). Perceptual symbol systems. Behavioural and Brain Sciences, 22, 577-609.

Google

Bloom, P. (2000). How children learn the meanings of words. Cambridge, MA: MIT Press.

Google

Brent, M. (1999). An efficient, probabilistically sound algorithm for segmentation and word discovery. Machine Learning, 34, 71-106.

Google

Brooks, R. (1986). A robust layered control system for a mobile robot. IEEE Journal of Robotics and Automation, 2(1), 14-23.

Google

Cangelosi, A., & Harnad, S. (2002). The adaptive advantage of symbolictheft over sensorimotor toil: Grounding language in perceptual categories. Evolution of Communication.

Google UIUC

Cover, T., & Thomas, J. A. (1991). Elements of information theory. New York, NY: WileyInterscience.

Google

de Marcken, C. (1996). Unsupervised language acquisition. Unpublished doctoral dissertation, Massachusetts Institute of Technology, Cambridge, MA.

Google

Deacon, T. (1997). The symbolic species : The co-evolution of language and the brain. Norton.

Google UIUC

Garofolo, J. (1988). Getting started with the DARPA TIMIT CD-ROM: An acoustic phonetic continuous speech database. Gaithersburgh, MD: National Institute of Standards and Technology (NIST).

Google

Harnad, S. (1990). The symbol grounding problem. Physica D, 42, 335-346.

Google UIUC

Huttenlocher, J., & Smiley, P. (1994). Early word meanings: the case of object names. In P. Bloom (Ed.), Language acquisition: core readings (p. 222-247). Cambridge, MA: MIT Press.

Google

Jackendoff, R. (1983). Semantics and cognition. Cambridge, MA: MIT Press.

Google

Johnson, M. (1987). The body in the mind. Chicago: Univeristy of Chicago Press.

Google

kuhl, P., Williams, K., Lacerda, F., Stevens, K., & Lindblom, B. (1992). Linguistic experience alters phonetic perception in infants by 6 months of age. Science, 255, 606-608.

Google

Lakoff, G. (1987). Women, fire, and dangerous things. Chicago, IL: The University of Chicago Press.

Google

Newell, A., & Simon, H. (1976). Computer science as emperical inquiry: Symbols and search. Communications of the ACM, 19, 113-126.

Google

Peirce, C. (1932). Division of signs. In C. Hartshorne & P. Weiss (Eds.), Collected papers of charles sanders peirce (Vol. II). Cambridge, MA: Harvard Univeristy Press.

Google

Rabiner, L. (1989). A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 77 (2), 257-285.

Google

Regier, T. (1996). The human semantic potential. Cambridge, MA: MIT Press.

Google

Robinson, T. (1994). An application of recurrent nets to phone probability estimation. IEEE Trans. Neural Networks, 5 (3).

Google

Roy, D. (1999). Learning words from sights and sounds: A computational model. Unpublished doctoral dissertation, Massachusetts Institute of Technology.

Google

Roy, D. (2000). Integration of speech and vision using mutual information. In Proc. of ICASSP. Istanbul, Turkey.

Google

Roy, D. (In press). Grounded spoken language acquisition: Experiments in word learning. IEEE Transactions on Multimedia.

Google

Roy, D. (In review). Learning to generate visually grounded spoken language. Computer Speech and Language.

Google

Roy, D., & Pentland, A. (2002). Learning words from sights and sounds: A computational model. Cognitive Science, 26 (1), 113-146.

Google

Sankar, A., & Gorin, A. (1993). Adaptive language acquisition in a multi-sensory device. In Artificial neural networks for speech and vision (p. 324-356). London: Chapman and Hall.

Google

Schiele, B., & Crowley, J. (1996). Probabilistic object recognition using multidimensional receptive field histograms. In ICPR'96 proceedings of the 13th international conference on pattern recognition, volume b (pp. 50--54).

Google

Searle, J. (1980). Minds, brains, and programs. The Behavioural and Brain Sciences, 3.

Google

Siskind, J. (1992). Naive physics, event perception, lexical semantics, and language acquisition. Unpublished doctoral dissertation, Massachusetts Institute of Technology.

Google

Siskind, J. (2001). Grounding the Lexical Semantics of Verbs in Visual Perception using Force Dynamics and Event Logic. Artificial Intelligence Review, 15, 31-90.

Google

Steels, L., & Kaplan, F. (2002). Aibo's first words. the social learning of language and meaning. Evolution of Communication.

Google UIUC

Steels, L., & Vogt, P. (1997). Grounding adaptive language games in robotic agents. In C. Husbands & I. Harvey (Eds.), Proceedings of the 4th european conference on artificial life. Cambridge, MA: MIT Press.

Google UIUC

 HOME   ::   Back to the Paper   ::   References Comments to: junwang4 you-know-at gmail.com Last update: 11/16/07