The Open American National Corpus is a roughly 15 million word subset of the ANC Second Release that is unrestricted in terms of usage and redistribution. Since 2006, the ANC project has committed to producing only open data. Therefore, the OANC and MASC are the only resources that the ANC project will continue to develop.
The philosophy behind the OANC and MASC is one of collaborative community development, wherein members of the community can freely use all of our data and resources without restriction (including commercial use), with the expectation that they will contribute derived data and annotations when possible. We also hope that the community will contribute additional data to be included in the OANC and/or MASC in the future.
At present, the ANC project has approximately 40-50 million words of open data on hand, which will be processed for distribution as funding allows.
The ANC project is a supporter of and contributor to the Linguistic Linked Open Data Cloud