search/data aggregation

SSWL (Syntactic Structures of the World's Languages)

SSWL (Syntactic Structure of the World's Languages) is a open-ended database of syntactic, morphological and semantic properties. Each language is characterized by a set of property-value pairs (e.g., Object Verb: Yes), and examples that illustrate these property value pairs. A rich variety of search functions are available, as well as mapping and the creation of similarity trees. The database is open-ended in the sense that (a) new language experts may sign up to add new languages, and (b) new properties may be added.

RELISH-Symposium „Rendering Endangered Lexicons Interoperable through Standards Harmonization”, Frankfurt, October 10, 2011 “RELISH meets LOEWE”

The RELISH project promotes language-oriented research by addressing a two-pronged problem: (1) the lack of harmonization between digital standards for lexical information in Europe and America, and (2) the lack of interoperability among existing lexicons of endangered languages, in particular those created with the Shoebox lexicon building software. The cooperation partners in the RELISH project are the University of Frankfurt (FRA), the Max Planck Institute for Psycholinguistics (MPI), and Eastern Michigan University, the host of the Linguist List (ILIT).

"Linked Data in Linguistics" at DGfS 2012

Linked Data in Linguistics
Linguists from all disciplines produce more and more data and share the challenge how to make this data accessible to other researchers in their field and beyond. This does not only concern the general availability of data, but also the representation of the structure of the data. Linked Data is one paradigm which can be employed to tackle this task.
We are happy to announce the workshop "Linked Data in Linguistics" at the annual meeting of the German Linguistic Society (Deutsche Gesellschaft für Sprachwissenschaft, DGfS) taking place March 7-9, 2012 in Frankfurt a.M., Germany.

Data provenance and data aggregation

Peter Austin, over at Endangered Languages and Cultures, has initiated a discussion on citation practices (with James McElvenny also participating), and it was prompted (at least partly) by some data I have had a role in processing as part of the LEGO project.

Interdisciplinary Centre for Social and Language Documentation in Portugal

The Centro Interdisciplinar de Documentação Linguística e Social (CIDLeS) is an interdisciplinary non-profit centre dedicated to the documentation and preservation of the linguistic (and cultural) heritage in Europe. It was founded in January 2010 as a result of the work of a number of researchers at the Institute of General Linguistics and Language Typology at the University of Munich and at the Department of Portuguese Studies at the Universidade Nova de Lisboa.

A Grand Challenge for Linguistics: Scaling Up and Integrating Models

In response to NSF's call for White Papers in the SBE 2020 Initiative, Jeff Good and I have submitted a paper outlining our take on Cyberinfrastructure for Linguistics, why its necessary, and how it can come about. The abstract:

Abney & Bird's Grand Challenge: The Human Language Project

Steven Abney and Steven Bird published a provocative paper (.pdf) at ACL 2010 calling on the computational linguistics community to work to create a "Universal Corpus", an undertaking that they compare in both scale and potential impact to the Human Genome Project. Here is the abstract:

RELISH Meeting in Nijmegen

On 4–5 August, the RELISH project held a workshop on lexicon tools and lexical standards. Slides from many of the presentations are posted on the workshop site.

Workshop on Advanced Corpus Solutions, PACLIC 24

Call for papers: http://www.hf.uio.no/tekstlab/paclic/index.html

Submission deadline: June 14, 2010
Workshop date: November 4, 2010

This workshop invites papers on advances in corpus types and corpus tools in support of linguistic research.

NSF Software Infrastructure for Sustained Innovation (SI**2)

On March 16, the National Science Foundation announced the Software Infrastructure for Sustained Innovation (SI**2) Program Solicitation 10-551 at http://www.nsf.gov/pubs/2010/nsf10551/nsf10551.pdf. This is an NSF-wide solicitation, led by the Office of Cyberinfrastructure (OCI).

Syndicate content
Powered by Drupal, an open source content management system