Alpino-Webservice

Alpino is a dependency parser for Dutch, developed in the context of the PIONIER Project Algorithms for Linguistic Processing, developed by Gertjan van Noord at the University of Groningen. This is the webservice for it. You can upload either tokenised or untokenised files (which will be automatically tokenised for you using ucto), the output will consist of a zip file containing XML files, one for each sentence in the input document.

Provided tools & services

Alpino Webservice

Alpino is a dependency parser for Dutch, developed in the context of the PIONIER Project Algorithms for Linguistic Processing, developed by Gertjan van Noord at the University of Groningen. You can upload either tokenised or untokenised files (which will be automatically tokenised for you using ucto), the output will consist of a zip file containing XML files, one for each sentence in the input document.
Type
  • Web Application
Version
2.4.1
Service Provider
      Rijksuniversiteit Groningen (backend), Radboud Universiteit Nijmegen (webservice)
Input data
Name
*.txt
Description
Plaintext document (untokenised)
Type
DigitalDocument
Encoding Format
text/plain
Name
*.tok
Description
Plaintext tokenised input, one sentence per line
Type
DigitalDocument
Encoding Format
text/plain
Output data
Name
*.folia.xml
Description
FoLiA XML Output
Type
TextDigitalDocument
Encoding Format
text/xml
Name
*.alpinoxml.zip
Description
Alpino XML output (XML files per sentence)
Type
DigitalDocument
Encoding Format
application/zip
Name
*.tok
Description
Plaintext tokenised output, one sentence per line
Type
DigitalDocument
Encoding Format
text/plain
Name
error.log
Description
Log file with (standard) error output
Type
DigitalDocument
Encoding Format
text/plain

Tool suite: Alpino

The following closely related tools are in a tool suite together with Alpino-Webservice:

  • Command-line Application
  • Complete: The technology is complete, stable and deployed in production scenarios for end-users
  • Active: The project has reached a stable, usable state and is being actively developed.

Alpino 0.0.0

Alpino parser and related tools for Dutch [view more]
  • Linguistics
  • nwo:ComputationalLinguisticsandPhilology
  • Software for humanities
  • Structural Analysis
  • Docker
  • Linux

Citation

You can cite this software using the following citation generated from its metadata:

(2024) Alpino-Webservice 2.4.1 .
  • KNAW Humanities Cluster & CLST, Radboud University
.

Logs & Reviews

Name
Automatic software metadata validation report for Alpino-Webservice 2.4.1
Author
  • codemetapy validator using software.ttl
Date
2024-12-09 03:00:16
Review
Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems

Validation of Alpino-Webservice 2.4.1 was successful (score=3/5), but there are some warnings which should be addressed:

1. Info: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata)
2. Warning: Documentation *SHOULD* be expressed (This is missing in the metadata)
3. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)
4. Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)
5. Info: A research domain *SHOULD* be expressed as a category using the NWO Research Fields vocabulary, if applicable (This is missing in the metadata)
6. Info: A research activity *SHOULD* be expressed as a category using the TaDiRaH vocabulary (This is missing in the metadata)
Rating
★ ★ ★ ☆ ☆
(log file starts at Mon Dec  9 03:00:00 UTC 2024)

[harvester info] --> Processing alpino-service (https://github.com/proycon/alpino_clam_webservice) [Mon Dec  9 03:00:00 UTC 2024]

[harvester info] Git updating cached clone of https://github.com/proycon/alpino_clam_webservice...

[harvester info] Found release v2.4.1

[harvester info] Using 'v2.4.1'

[harvester info] Git reference: v2.4.1

[harvester info] Scanning directory /tmp/codemeta-harvester.cache/alpino-service for harvestable resources...

[harvester info] found codemeta-harvest.json for alpino-service (md5sum dc9e71f716c2bf61ee5f99729991134a); values in here take precendence over (override) those in later detection stages

[harvester info] found python setup for alpino-service, converting to codemeta

[harvester info] Looking for license....

[harvester info] No license file found

[harvester info] Getting contributors from git...

[harvester info] No git contributors found

[harvester info] Getting top contributor from git...

[harvester info] Git top contributor  will be assigned as author (and maintainer) if none are found in the metadata

[harvester info] Extracting last and first commit date from git log....

[harvester info] Date created: 2015-09-08T23:41:30Z+0200, date modified: 2024-10-17T17:01:23Z+0200

[harvester info] Querying Github/GitLab API (https://github.com/proycon/alpino_clam_webservice)

[harvester info] Adding URL for found README: README.md

[harvester info] Found releaseNotes

[harvester info] Querying Zenodo API for DOI (access token provided)...

[harvester info] Looking for TRL information in README.md...

[harvester info] Looking for repostatus information in README.md...

[harvester info] Found repostatus https://www.repostatus.org/#active

[harvester info] Looking for continuous integration information in README.md...

[harvester info] Looking for documentation links in README.md...

[harvester info] Falling back to git tag (v2.4.1) if no version number is specified...

[harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)...

[harvester info] Inferred repostatus https://www.repostatus.org/#active

[harvester info] Looking for repostatus information in README.md in master branch...

[harvester info] Found repostatus (master branch) https://www.repostatus.org/#active

[harvester info] Setting group Alpino

[harvester info] Reconciliating: codemetapy  --baseuri https://tools.clariah.nl --baseuri https://tools.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "alpino-service" --codeRepository "https://github.com/proycon/alpino_clam_webservice" --validate /etc/software.ttl --released --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/alpino-service.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-version.alpino-service.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.alpino-service.codemeta.json /tmp/codemeta-harvester.cache//tmp/90-authors.alpino-service.codemeta.json /tmp/codemeta-harvester.cache//tmp/43-releasenotes.alpino-service.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.alpino-service.codemeta.json /tmp/codemeta-harvester.cache//tmp/40-gitapi.alpino-service.codemeta.json /tmp/codemeta-harvester.cache//tmp/39-gitdate.alpino-service.codemeta.json /tmp/codemeta-harvester.cache//tmp/20-python.alpino-service.codemeta.json /tmp/codemeta-harvester.cache//tmp/11-repostatus.alpino-service.codemeta.json /tmp/codemeta-harvester.cache//tmp/10-harvest.alpino-service.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-repostatus.alpino-service.codemeta.json /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.alpino-service.codemeta.json 

-- begin log --

Passed 12 files/sources but specified 0 input types! Automatically guessing types...

Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-version.alpino-service.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/99-repostatus.alpino-service.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/90-authors.alpino-service.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/43-releasenotes.alpino-service.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.alpino-service.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/40-gitapi.alpino-service.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/39-gitdate.alpino-service.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/20-python.alpino-service.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/11-repostatus.alpino-service.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/10-harvest.alpino-service.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-repostatus.alpino-service.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/04-applicationSuite.alpino-service.codemeta.json', 'json')]

Adding to contextgraph: /tmp/turtle

Initial URI automatically generated, may be overriden later: https://tools.clariah.nl/alpino-service

Processing source #1 of 12

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-version.alpino-service.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/alpino-service

[CODEMETA COMPOSITION (https://tools.clariah.nl/alpino-service)] processed 1 new triples, total is now 2

Processing source #2 of 12

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.alpino-service.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/alpino-service

[CODEMETA COMPOSITION (https://tools.clariah.nl/alpino-service)] processed 1 new triples, total is now 3

Processing source #3 of 12

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/90-authors.alpino-service.codemeta.json

    Found main resource with URI https://tools.clariah.nl/alpino-service.topcontributor/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/alpino-service

[CODEMETA COMPOSITION (https://tools.clariah.nl/alpino-service)] processed 1 new triples, total is now 3

Processing source #4 of 12

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/43-releasenotes.alpino-service.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/alpino-service

[CODEMETA COMPOSITION (https://tools.clariah.nl/alpino-service)] processed 2 new triples, total is now 5

Processing source #5 of 12

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.alpino-service.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/alpino-service

[CODEMETA COMPOSITION (https://tools.clariah.nl/alpino-service)] processed 1 new triples, total is now 6

Processing source #6 of 12

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/40-gitapi.alpino-service.codemeta.json

    Found main resource with URI https://tools.clariah.nl/alpino_clam_webservice/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/alpino-service

[CODEMETA COMPOSITION (https://tools.clariah.nl/alpino-service)] processed 19 new triples, total is now 24

Processing source #7 of 12

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/39-gitdate.alpino-service.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/alpino-service

[CODEMETA COMPOSITION (https://tools.clariah.nl/alpino-service)] overriding old http://schema.org/dateCreated (2015-09-08T21:43:45Z -> 2015-09-08T23:41:30Z+0200)

[CODEMETA COMPOSITION (https://tools.clariah.nl/alpino-service)] overriding old http://schema.org/dateModified (2024-10-17T15:08:58Z -> 2024-10-17T17:01:23Z+0200)

[CODEMETA COMPOSITION (https://tools.clariah.nl/alpino-service)] processed 2 new triples, total is now 24

Processing source #8 of 12

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/20-python.alpino-service.codemeta.json

    Found main resource with URI https://tools.clariah.nl/alpino-webservice/2.4.1

    Injected (possibly temporary) URI https://tools.clariah.nl/alpino-service

[CODEMETA COMPOSITION (alpino-webservice)] overriding old http://schema.org/author (https://tools.clariah.nl/stub/H07db6c2bad6c57ef -> https://tools.clariah.nl/stub/H-7f72eb4c6f7405de)

[CODEMETA COMPOSITION (alpino-webservice)] overriding old http://schema.org/description (A CLAM-powered webservice for Alpino, a dependency parser for Dutch -> Alpino is a dependency parser for Dutch, developed in the context of the PIONIER Project Algorithms for Linguistic Processing, developed by Gertjan van Noord at the University of Groningen. This is the webservice for it. You can upload either tokenised or untokenised files (which will be automatically tokenised for you using ucto), the output will consist of a zip file containing XML files, one for each sentence in the input document.)

[CODEMETA COMPOSITION (alpino-webservice)] overriding old http://schema.org/name (alpino_clam_webservice -> Alpino-Webservice)

[CODEMETA COMPOSITION (alpino-webservice)] overriding old http://schema.org/version (v2.4.1 -> 2.4.1)

[CODEMETA COMPOSITION (alpino-webservice)] processed 69 new triples, total is now 83

Processing source #9 of 12

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/11-repostatus.alpino-service.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/alpino-service

[CODEMETA COMPOSITION (alpino-webservice)] processed 1 new triples, total is now 83

Processing source #10 of 12

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/10-harvest.alpino-service.codemeta.json

    Injected (possibly temporary) URI https://tools.clariah.nl/alpino-service

[CODEMETA COMPOSITION (alpino-webservice)] overriding old http://schema.org/softwareRequirements (https://tools.clariah.nl/dependency/folia-tools -> https://tools.clariah.nl/stub/H-24a0653e2d9eed5e)

[CODEMETA COMPOSITION (alpino-webservice)] overriding old http://schema.org/softwareRequirements (https://tools.clariah.nl/dependency/natsort -> https://tools.clariah.nl/stub/H-24a0653e2d9eed5e)

[CODEMETA COMPOSITION (alpino-webservice)] overriding old http://schema.org/softwareRequirements (https://tools.clariah.nl/dependency/clam-ge-3-1-4 -> https://tools.clariah.nl/stub/H-24a0653e2d9eed5e)

[CODEMETA COMPOSITION (alpino-webservice)] processed 8 new triples, total is now 88

Processing source #11 of 12

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-repostatus.alpino-service.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/alpino-service

[CODEMETA COMPOSITION (alpino-webservice)] processed 1 new triples, total is now 88

Processing source #12 of 12

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.alpino-service.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/alpino-service

[CODEMETA COMPOSITION (alpino-webservice)] processed 1 new triples, total is now 89

Remapping URI to (possibly) new identifier and version component: https://tools.clariah.nl/alpino-service -> https://tools.clariah.nl/alpino-service/2.4.1

[CODEMETA VALIDATION (alpino-service)] done

[CODEMETA ENRICHMENT (alpino-service)] Guessing interface type http://schema.org/WebAPI based on clues

[CODEMETA ENRICHMENT (alpino-service)] automatically adding programmingLanguage Python derived from runtimePlatform Python

[CODEMETA ENRICHMENT (alpino-service)] automatically adding programmingLanguage Python derived from runtimePlatform Python

[CODEMETA ENRICHMENT (alpino-service)] automatically adding programmingLanguage Python derived from runtimePlatform Python

[CODEMETA ENRICHMENT (alpino-service)] automatically adding programmingLanguage Python derived from runtimePlatform Python

[CODEMETA ENRICHMENT (alpino-service)] automatically adding programmingLanguage Python derived from runtimePlatform Python

[CODEMETA ENRICHMENT (alpino-service)] automatically adding programmingLanguage Python derived from runtimePlatform Python

[CODEMETA ENRICHMENT (alpino-service)] adding author https://tools.clariah.nl/person/maarten-van-gompel as contributor

[CODEMETA ENRICHMENT (alpino-service)] adding affiliation(s) of first author as producer

VALIDATION https://tools.clariah.nl/alpino-service/2.4.1 #1: Info: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/alpino-service/2.4.1 #2: Warning: Documentation *SHOULD* be expressed (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/alpino-service/2.4.1 #3: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/alpino-service/2.4.1 #4: Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/alpino-service/2.4.1 #5: Info: A research domain *SHOULD* be expressed as a category using the NWO Research Fields vocabulary, if applicable (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/alpino-service/2.4.1 #6: Info: A research activity *SHOULD* be expressed as a category using the TaDiRaH vocabulary (This is missing in the metadata)

-- end log --

[harvester info] Output written to /tmp/out/alpino-service.codemeta.json

[harvester info] Harvesting remote service URL https://webservices.cls.ru.nl/alpino for alpino-service: codemetapy  --baseuri https://tools.clariah.nl --baseuri https://tools.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl -O "/tmp/codemeta-harvester.cache//tmp/alpino-service.codemeta.json" "/tmp/out/alpino-service.codemeta.json" "https://webservices.cls.ru.nl/alpino"

[harvester info] <-- Finished processing alpino-service (https://github.com/proycon/alpino_clam_webservice) [Mon Dec  9 03:00:20 UTC 2024]

        

Metadata Properties

Version
2.4.1 (release notes)
Interface types
  • Web Application
Software website
Source code repository
 https://github.com/proycon/alpino_clam_webservice  Stars are an indicator of the popularity of this project on GitHub
Category
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
Keywords
  • dependency parsing
  • folia
  • linguistics
  • nlp
  • syntax
Development Status
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.
  • Active: The project has reached a stable, usable state and is being actively developed.
Issue Tracker (Support)
https://github.com/proycon/alpino_clam_webservice/issues  The number of open issues on the issue tracker  The number of closes issues on the issue tracker
Documentation
License
Author(s)
Maintainer(s)
Contributor(s)
Producer
  •   KNAW Humanities Cluster & CLST, Radboud University
Programming Language
  • Python
Runtime Platform
  • Python 3
  • Python 3.10
  • Python 3.6
  • Python 3.7
  • Python 3.8
  • Python 3.9
Operating System
  • BSD
  • Linux
  • macOS
Software dependencies
  • Alpino
  • ucto
Metadata validation
★ ★ ★ ☆ ☆
Created
2015-09-08 23:41:30 +0200
Last modified
2024-10-17 17:01:23 +0200  Last commit (main branch). Gives an indication of project development activity and rough indication of how up-to-date the latest release is.  Number of commits since the last release. Gives an indication of project development activity and rough indication of how up-to-date the latest release is.