Name Last Update
..
data Loading commit data...
resources Loading commit data...
.gitignore Loading commit data...
CONLL.ml Loading commit data...
CONLL2.ml Loading commit data...
LCGfields.ml Loading commit data...
README Loading commit data...
conllParser.ml Loading commit data...
freq_test.ml Loading commit data...
generate.ml Loading commit data...
info_sentences.txt Loading commit data...
info_sentences2.txt Loading commit data...
interpsInCorpus.ml Loading commit data...
makefile Loading commit data...
resources.ml Loading commit data...
struktura do NKJP.txt Loading commit data...
test_conll.ml Loading commit data...
test_conll2.ml Loading commit data...
types.ml Loading commit data...
validateTokenizer.ml Loading commit data...

README

ENIAMcorpora Version 1.0 :
-----------------------

ENIAMcorpora is a library that
- parses corpuses into CONLL format;
- converts dependencies structures;
- tests results of conversion.

Install
-------

ENIAMcorpora requires OCaml version 4.02.3 compiler
together with Xlib library version 3.2 or later,
ENIAMtokenizer library version 1.1, ENIAMmorphology library version 1.1,
ENIAMsubsyntax library version 1.1, ENIAMintegration library version 1.0,
ENIAM_LCGparser library version 2.0, ENIAM_LCGlexicon library version 1.0,
ENIAMsemValence library version 1.0.

In order to install type:

make install

by default, ENIAMcorpora is installed in the 'ocamlc -where'/eniam directory.
you can change it by editing the Makefile.

In order to test library type (graphviz installed required):
make test
./test

By default ENIAMcorpora looks for resources in /usr/share/eniam directory.
However this behaviour may be changed by setting end exporting ENIAM_RESOURCE_PATH
environment variable.

Credits
-------
Copyright © 2016 Wojciech Jaworski <wjaworski atSPAMfree mimuw dot edu dot pl>
Copyright © 2016 Daniel Oklesinski <oklesinski dot daniel atSPAMfree gmail dot com>
Copyright © 2016 Institute of Computer Science Polish Academy of Sciences

The library uses the following licensed resources:

NKJP1M: the manually annotated 1-million word subcorpus sampled
from texts of a subset of the National Corpus of Polish.
version 1.2

SGJP: Grammatical Dictionary of Polish, version 20151020
Copyright © 2007–2015 Zygmunt Saloni, Włodzimierz Gruszczyński, Marcin
Woliński, Robert Wołosz, Danuta Skowrońska

Licence
-------

This library is free software: you can redistribute it and/or modify
it under the terms of the GNU Lesser General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This library is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU Lesser General Public License for more details.

You should have received a copy of the GNU Lesser General Public License
along with this program.  If not, see <http://www.gnu.org/licenses/>.