The ANC2Go beta
Note: the the ANC2Go application is still undergoing
testing and development and
the output may not always be well formed. Please contact
anc@cs.vassar.edu if you encounter any problems.
Instructions
The ANC2Go application can be used to generate various output
formats for the OANC, MASC, and Wordnet (coming soon) corpora.
- Enter your email address. Processing the corpus may take some
time and you will be emailed a link to the download when processing has
completed.
- Select one of the available corpora. Only a single corpus can be
processed at a time.
- Click the Browse button to select the directories in
the corpus to be included. You can include as much or as little of the
corpus as you would like (only directories can be selected, not
individual files).
- Select your desired output format and the annotations to
include. Not all annotations are available for all output formats.
- Click the Process button. Processing will generally take
a few minutes depending on server load and the
number of files you have selected.
NOTES
- Not all annotation types can be freely mixed and matched. In
particular, the Penn Tree Bank annotations consist of a well formed
tree, which may or may not be so well formed after merging with other
annotation types into XML. The ANC2Go applications will resolve
overlapping annotations with one of the strategies below, but these are
intended for simple cases.
NOTE: The ANC2Go service is currently offline for maintenance and upgrades.
Please contact the
webmaster if you have any comments
or questions regarding the ANC website.
Copyright © 2002-2010 American
National Corpus Project. All rights reserved.