folia

High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)

Provided tools & services

folia

Type
  • Software Library

Tool suite: FoLiA

The following closely related tools are in a tool suite together with folia:

  • Command-line Application
  • 9 - Proven: Technology complete and proven in practice by real users.
  • Active: The project has reached a stable, usable state and is being actively developed.

FoLiA tools 2.5.5

  •   KNAW Humanities Cluster & CLST, Radboud University
FoLiA-tools contains various Python-based command line tools for working with FoLiA XML (Format for Linguistic Annotation) [view more]
  • Annotating
  • https://w3id.org/nwo-research-fields#ComputationalLinguisticsandPhilology
  • Textual and linguistic corpora
  • annotation
  • computational linguistics
  • folia
  • nlp
  • search
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2011-01-14
Modified: 2024-02-05
  • Software Library
  • 9 - Proven: Technology complete and proven in practice by real users.
  • Active: The project has reached a stable, usable state and is being actively developed.

FoLiApy 2.5.9

  •   KNAW Humanities Cluster & CLST, Radboud University
An extensive library for processing FoLiA documents. FoLiA stands for Format for Linguistic Annotation and is a very rich XML-based format used by various Natural Language Processing tools. [view more]
  • Annotating
  • https://w3id.org/nwo-research-fields#ComputationalLinguisticsandPhilology
  • Textual and linguistic corpora
  • annotation
  • computational linguistics
  • folia
  • format
  • nlp
  • xml
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2010-05-27
Modified: 2024-02-05
  • Command-line Application
  • Active: The project has reached a stable, usable state and is being actively developed.

foliautils 0.20

Command-line utilities for working with the Format for Linguistic Annotation (FoLiA). [view more]
  • folia
  • linguistic annotation
  • natural language processing
  • nlp
  • xml
  • Posix
  • Command-line Application
  • Software Library
  • Active: The project has reached a stable, usable state and is being actively developed.

libfolia 2.17

This is a C++ Library for working with the Format for Linguistic Annotation (FoLiA). [view more]
  • folia
  • linguistic annotation
  • natural language processing
  • nlp
  • xml
  • Posix
  • Web Application
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.
  • Inactive: The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows.

piereling 0.4

  •   KNAW Humanities Cluster & CLST, Radboud University
Piereling is a webservice and web-application to convert between a variety of document formats, mostly from and to FoLiA XML. It is intended for NLP pipelines. [view more]
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
  • webservice nlp computational_linguistics rest folia conversion
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2019-10-18
Modified: 2023-11-01

Citation

You can cite this software using the following citation generated from its metadata:

(2020) folia 0.0.6 .
  • KNAW Humanities Cluster & CLST, Radboud University
.

Logs & Reviews

Name
Automatic software metadata validation report for folia 0.0.6
Author
  • codemetapy validator using software.ttl
Date
2024-02-25 03:08:53
Review
Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems

Validation of folia 0.0.6 was successful (score=3/5), but there are some warnings which should be addressed:

1. Warning: Documentation *SHOULD* be expressed (The metadata does express this currently, but something is wrong in the way it is expressed. Is the type/class valid?)
2. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)
3. Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)
4. Info: The technology readiness level *SHOULD* be expressed (This is missing in the metadata)
5. Info: A research domain *SHOULD* be expressed as a category using the NWO Research Fields vocabulary, if applicable (This is missing in the metadata)
6. Info: A research activity *SHOULD* be expressed as a category using the TaDiRaH vocabulary (This is missing in the metadata)
Rating
★ ★ ★ ☆ ☆
(log file starts at Sun Feb 25 03:08:33 UTC 2024)

[harvester info] --> Processing folia-rust (https://github.com/proycon/folia-rust) [Sun Feb 25 03:08:33 UTC 2024]

[harvester info] Git updating cached clone of https://github.com/proycon/folia-rust...

[harvester info] Found release v0.0.6

[harvester info] Using 'v0.0.6'

[harvester info] Git reference: v0.0.6

[harvester info] Scanning directory /tmp/codemeta-harvester.cache/folia-rust for harvestable resources...

[harvester info] found Cargo.toml (rust) for folia-rust, converting to codemeta

[harvester info] Looking for license....

[harvester info] Found license GPL-3.0-only

[harvester info] Getting contributors from git...

[harvester info] Getting top contributor from git...

[harvester info] Git top contributor Maarten van Gompel <proycon@anaproy.nl> will be assigned as author (and maintainer) if none are found in the metadata

[harvester info] Extracting last and first commit date from git log....

[harvester info] Date created: 2019-06-08T21:34:53Z+0200, date modified: 2020-11-16T14:24:33Z+0100

[harvester info] Querying Github/GitLab API (https://github.com/proycon/folia-rust)

[harvester info] Adding URL for found README: README.md

[harvester info] Found releaseNotes

[harvester info] Querying Zenodo API for DOI (access token provided)...

[harvester info] Looking for TRL information in README.md...

[harvester info] Looking for repostatus information in README.md...

[harvester info] Found repostatus https://www.repostatus.org/#active

[harvester info] Looking for continuous integration information in README.md...

[harvester info] Found CI https://travis-ci.com/proycon/folia

[harvester info] Looking for documentation links in README.md...

[harvester info] Scraping title from https://docs.rs/folia/

[harvester info] Found documentation at https://docs.rs/folia/ : "name": "folia - Rust",

[harvester info] Scraping title from https://folia.readthedocs.io/en/latest/implementations.html

[harvester info] Found documentation at https://folia.readthedocs.io/en/latest/implementations.html : "name": "Implementations — FoLiA: Format for Linguistic Annotation v2.0 (rev 9.0) documentation",

[harvester info] Falling back to git tag (v0.0.6) if no version number is specified...

[harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)...

[harvester info] Inferred repostatus https://www.repostatus.org/#inactive

[harvester info] Looking for repostatus information in README.md in master branch...

[harvester info] Found repostatus (master branch) https://www.repostatus.org/#inactive

[harvester info] Setting group FoLiA

[harvester info] Reconciliating: codemetapy  --baseuri https://tools.clariah.nl --baseuri https://tools.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "folia-rust" --codeRepository "https://github.com/proycon/folia-rust" --validate /etc/software.ttl --released --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-version.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/90-authors.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/50-documentation.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/43-releasenotes.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/40-gitapi.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/39-gitdate.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/32-contributors.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/29-license.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/23-rust.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/12-ci.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/11-repostatus.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-repostatus.folia-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.folia-rust.codemeta.json 

-- begin log --

Passed 15 files/sources but specified 0 input types! Automatically guessing types...

Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-version.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/99-repostatus.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/90-authors.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/50-documentation.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/43-releasenotes.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/40-gitapi.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/39-gitdate.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/32-contributors.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/29-license.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/23-rust.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/12-ci.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/11-repostatus.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-repostatus.folia-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/04-applicationSuite.folia-rust.codemeta.json', 'json')]

Adding to contextgraph: /tmp/turtle

Initial URI automatically generated, may be overriden later: https://tools.clariah.nl/folia-rust

Processing source #1 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-version.folia-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 1 new triples, total is now 2

Processing source #2 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.folia-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 1 new triples, total is now 3

Processing source #3 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/90-authors.folia-rust.codemeta.json

    Found main resource with URI https://tools.clariah.nl/folia-rust.topcontributor/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 8 new triples, total is now 10

Processing source #4 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/50-documentation.folia-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 8 new triples, total is now 18

Processing source #5 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/43-releasenotes.folia-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 2 new triples, total is now 20

Processing source #6 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.folia-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 1 new triples, total is now 21

Processing source #7 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/40-gitapi.folia-rust.codemeta.json

    Found main resource with URI https://tools.clariah.nl/folia-rust/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/author (https://tools.clariah.nl/stub/H214009fa050fed43 -> https://tools.clariah.nl/stub/H-1645259754a1b733)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 24 new triples, total is now 39

Processing source #8 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/39-gitdate.folia-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/dateCreated (2019-06-08T19:36:18Z -> 2019-06-08T21:34:53Z+0200)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/dateModified (2022-09-13T13:28:28Z -> 2020-11-16T14:24:33Z+0100)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 2 new triples, total is now 39

Processing source #9 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/32-contributors.folia-rust.codemeta.json

    Found main resource with URI https://tools.clariah.nl/folia-rust.contributors/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 8 new triples, total is now 40

Processing source #10 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/29-license.folia-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/license (http://spdx.org/licenses/GPL-3.0-only -> GPL-3.0-only)

[CODEMETA CORRECTION (https://tools.clariah.nl/folia-rust)] automatically converting license to spdx URI

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 1 new triples, total is now 40

Processing source #11 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/23-rust.folia-rust.codemeta.json

    Found main resource with URI https://tools.clariah.nl/cargo.toml/0.0.6

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/author (https://tools.clariah.nl/stub/H-1645259754a1b733 -> https://tools.clariah.nl/stub/H214009fa050fed43)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/description (FoLiA library for rust (alpha) -> High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation))

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/keywords (folia -> annotation)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/keywords (rust -> annotation)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/license (http://spdx.org/licenses/GPL-3.0-only -> http://spdx.org/licenses/GPL-3.0-or-later)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/name (folia-rust -> folia)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old https://codemeta.github.io/terms/readme (https://github.com/proycon/folia-rust/blob/v0.0.6//README.md -> https://tools.clariah.nl/README.md)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/softwareHelp (https://docs.rs/folia/ -> https://docs.rs/folia)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/softwareHelp (https://folia.readthedocs.io/en/latest/implementations.html -> https://docs.rs/folia)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old http://schema.org/version (v0.0.6 -> 0.0.6)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 79 new triples, total is now 100

Processing source #12 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/12-ci.folia-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 1 new triples, total is now 101

Processing source #13 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/11-repostatus.folia-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old https://codemeta.github.io/terms/developmentStatus (https://www.repostatus.org/#inactive -> https://www.repostatus.org/#active)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 1 new triples, total is now 101

Processing source #14 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-repostatus.folia-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] overriding old https://codemeta.github.io/terms/developmentStatus (https://www.repostatus.org/#active -> https://www.repostatus.org/#inactive)

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 1 new triples, total is now 101

Processing source #15 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.folia-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/folia-rust

[CODEMETA COMPOSITION (https://tools.clariah.nl/folia-rust)] processed 1 new triples, total is now 102

Remapping URI to (possibly) new identifier and version component: https://tools.clariah.nl/folia-rust -> https://tools.clariah.nl/folia-rust/0.0.6

[CODEMETA VALIDATION (folia-rust)] done

[CODEMETA ENRICHMENT (folia-rust)] Guessing interface type https://w3id.org/software-types#SoftwareLibrary based on clues

[CODEMETA ENRICHMENT (folia-rust)] adding affiliation(s) of first author as producer

VALIDATION https://tools.clariah.nl/folia-rust/0.0.6 #1: Warning: Documentation *SHOULD* be expressed (The metadata does express this currently, but something is wrong in the way it is expressed. Is the type/class valid?)

VALIDATION https://tools.clariah.nl/folia-rust/0.0.6 #2: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/folia-rust/0.0.6 #3: Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/folia-rust/0.0.6 #4: Info: The technology readiness level *SHOULD* be expressed (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/folia-rust/0.0.6 #5: Info: A research domain *SHOULD* be expressed as a category using the NWO Research Fields vocabulary, if applicable (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/folia-rust/0.0.6 #6: Info: A research activity *SHOULD* be expressed as a category using the TaDiRaH vocabulary (This is missing in the metadata)

-- end log --

[harvester info] Output written to /tmp/out/folia-rust.codemeta.json

[harvester info] <-- Finished processing folia-rust (https://github.com/proycon/folia-rust) [Sun Feb 25 03:08:53 UTC 2024]

        

Metadata Properties

Version
0.0.6 (release notes)
Interface types
  • Software Library
Software website
Source code repository
 https://github.com/proycon/folia-rust  Stars are an indicator of the popularity of this project on GitHub
Category
  • ['science', 'text-processing']
Keywords
  • annotation
  • linguistics
  • nlp
  • text-processing
  • xml
Development Status
  • Inactive: The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows.
Issue Tracker (Support)
https://github.com/proycon/folia-rust/issues  The number of open issues on the issue tracker  The number of closes issues on the issue tracker
Documentation
License
Author(s)
Maintainer(s)
Contributor(s)
Producer
  •   KNAW Humanities Cluster & CLST, Radboud University
Programming Language
  • Rust
Continuous Integration Tests
https://travis-ci.com/proycon/folia
Software dependencies
  • chrono
  • clap
  • hex
  • libc
  • matches
  • quick-xml
  • rand
  • serde
  • serde_derive
  • strum
  • strum_macros
Metadata validation
★ ★ ★ ☆ ☆
Created
2019-06-08 21:34:53 +0200
Last modified
2020-11-16 14:24:33 +0100  Last commit (main branch). Gives an indication of project development activity and rough indication of how up-to-date the latest release is.  Number of commits since the last release. Gives an indication of project development activity and rough indication of how up-to-date the latest release is.