- user warning: Table 'placenci_gplac.og_subgroups' doesn't exist query: SELECT ogs.gid, ogs.parent, og.og_private, og.og_selective, n.title, n.status, n.type FROM og_subgroups ogs INNER JOIN node n ON ogs.gid = n.nid INNER JOIN og og ON ogs.gid = og.nid in /home/placenci/public_html/sites/all/modules/og/modules/og_subgroups/includes/tree.inc on line 56.
- user warning: Table 'placenci_gplac.og_subgroups' doesn't exist query: SELECT og.nid, og.og_private, og.og_selective, n.title, n.status, n.type FROM og og INNER JOIN node n ON og.nid = n.nid LEFT JOIN og_subgroups ogs ON og.nid = ogs.gid WHERE ogs.gid IS NULL in /home/placenci/public_html/sites/all/modules/og/modules/og_subgroups/includes/tree.inc on line 76.
Speech and Language Processing (2nd Edition)
An explosion of Web-based language techniques, merging of distinct fields, availability of phone-based dialogue systems, and much more make this an exciting time in speech and language processing. The first of its kind to thoroughly cover language technology – at all levels and with all modern technologies – this book takes an empirical approach to the subject, based on applying statistical and other machine-learning algorithms to large corporations.
Introduction to Information Retrieval [Hardcover]
Class-tested and coherent, this groundbreaking new textbook teaches web-era information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. Written from a computer science perspective by three leading experts in the field, it gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections.
Natural Language Understanding (2nd Edition)
In addition, this title offers coverage of two entirely new subject areas. First, the text features a new chapter on statistically-based methods using large corpora.
CLARION Project Home Page
CLARION Project Home Page
CLARION is a project investigating fundamental structures of the human mind. In particular, it aims to explore the interaction of implicit and explicit cognition, emphasizing bottom-up learning (i.e., learning that involves acquiring first implicit knwoledge and then acquiring explicit knowledge on its basis). The project is aimed at the synthesis of many interesting intellectual ideas into a coherent model of cognition.
SRILM - The SRI Language Modeling Toolkit
SRILM - The SRI Language Modeling Toolkit
SRILM is a toolkit for building and applying statistical language models (LMs), primarily for use in speech recognition, statistical tagging and segmentation, and machine translation. It has been under development in the SRI Speech Technology and Research Laboratory since 1995. The toolkit has also greatly benefitted from its use and enhancements during the Johns Hopkins University/CLSP summer workshops in 1995, 1996, 1997, and 2002 (see history).
Linguistic Data Consortium
Linguistic Data Consortium
The Linguistic Data Consortium supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards.
Engkoo
Engkoo
Microsoft researchers are using data mined from the Internet to develop Engkoo, an online Chinese-to-English dictionary and language-practice service. The technology could be used in similar tools to learn any language. Engkoo has a core of translation data drawn from Microsoft-licensed dictionaries. That content is mixed with data from Web sites with parallel Chinese and English versions. When an Engkoo user types a word or sentence into the Web site's input bar, in either Chinese or English, the site draws on statistics from its data to translate it.
Isis - Protecting Children in Online Social Networks
Isis - Protecting Children in Online Social Networks
Language Analysis Tool to Ascertain Age and Gender
The Engineer (United Kingdom) (06/22/10)
TextRunner Search
TextRunner Search
University of Washington researchers have developed an automated information extraction software engine that mines meaning out of more than 500 million Web pages, contributed by Google, by analyzing fundamental relationships between words. The project expands the scale of the TextRunner application in terms of the number of pages and the breadth of topics it can examine.
SMART - Statistical Multilingual Analysis for Retrieval and Translation
SMART - Statistical Multilingual Analysis for Retrieval and Translation
European researchers working on the Statistical Multilingual Analysis for Retrieval and Translation (SMART) project have developed technology that will enable machine translation using statistical analysis. SMART researchers were inspired by the Pascal Network of Excellence, which sought to develop cooperative ties among Europe's leaders in pattern analysis, statistical modeling, and computational learning.
SIL International
SIL International
SIL International Partners in Language Development SIL serves language communities worldwide, building their capacity for sustainable language development, by means of research, translation, training and materials development.
Visual Dictionary of English - Teaching computers to recognize objects
Visual Dictionary of English - Teaching computers to recognize objects
We present a visualization of all the nouns in the English language arranged by semantic meaning. Each of the tiles in the mosaic is an arithmetic average of images relating to one of 53,464 nouns. The images for each word were obtained using Google's Image Search and other engines. A total of 7,527,697 images were used, each tile being the average of 140 images. The average reveals the dominant visual characteristics of each word. For some, the average turns out to be a recognizable image; for others the average is a colored blob.
Rosetta Project
Rosetta Project

The Rosetta Project is The Long Now Foundation's first exploration into very long-term archiving. It serves as a means to focus attention on the problem of digital obsolescence, and ways we might address that problem through creative archival storage methods.
Read the Web"
Read the Web"
Carnegie Mellon University (CMU) researchers are developing the Never-Ending Language Learning (NELL) system, a computer that can master semantics by learning more like a human. NELL was provided with basic knowledge in various categories and connected to the Web with a mission to teach itself. "For all the advances in computer science, we still don't have a computer that can learn as humans do, cumulatively, over the long term," says CMU professor Tom M. Mitchell.
TONES
TONES
The aim of the project is study and develop automated reasoning techniques for both offline and online tasks associated with ontologies, either seen in isolation or as a community of interoperating systems, and devise methodologies for the deployment of such techniques, on the one hand in advanced tools supporting ontology design and management, and on the other hand in applications supporting software agents in operating with ontologies.
CMU-Cambridge Statistical Language Modeling Toolkit v2
CMU-Cambridge Statistical Language Modeling Toolkit v2
Overview of the CMU SLM Toolkit, Rev 1.0
- Login to post comments


