python-frog

Python binding to Frog, an NLP suite for Dutch doing part-of-speech tagging, lemmatisation, morphological analysis, named-entity recognition, shallow parsing, and dependency parsing.

Tool suite: Frog

The following closely related tools are in a tool suite together with python-frog:

  • Command-line Application
  • Software Library
  • 9 - Proven: Technology complete and proven in practice by real users.
  • Active: The project has reached a stable, usable state and is being actively developed.
thumbnail/logo

Frog 0.33

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. It performs automatic linguistic enrichment such as part of speech tagging, lemmatisation, named entity recognition, shallow parsing, dependency parsing and morphological analysis. All NLP modules are based on TiMBL. [view more]
  • Annotating
  • Contextualizing
  • Linguistics
  • Named Entity Recognition
  • POS-Tagging
  • Segmenting
  • Tagging
  • Textual and content analysis
  • Tree-Tagging
  • dependency parsing
  • dutch
  • lemma
  • lemmatisation
  • natural language processing
  • ner
  • nlp
  • parser
  • part-of-speech tagging
  • pos
  • shallow parsing
  • tagger
  • Bsd
  • Linux
  • Macos
Created: 2011-03-31
Modified: 2023-12-05
  • Web Application
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.
  • Active: The project has reached a stable, usable state and is being actively developed.

Frog-Webservice 2.7

Frog is a suite containing a tokeniser, Part-of-Speech tagger, lemmatiser, morphological analyser, shallow parser, and dependency parser for Dutch. This is the webservice for it, for both humans and machines. [view more]
  • Annotating
  • Contextualizing
  • Linguistics
  • Named Entity Recognition
  • POS-Tagging
  • Segmenting
  • Tagging
  • Textual and content analysis
  • Tree-Tagging
  • clam webservice rest nlp computational_linguistics rest
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2022-02-17
Modified: 2023-12-05
  • Active: The project has reached a stable, usable state and is being actively developed.

toad v0.8

Toad: Trainer Of All Data, the Frog training collection [view more]
Created: 2015-12-08
Modified: 2023-02-22

Citation

You can cite this software using the following citation generated from its metadata:

Logs & Reviews

Name
Automatic software metadata validation report for python-frog 0.6.10
Author
  • codemetapy validator using software.ttl
Date
2024-06-15 03:15:32
Review
Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems

Validation of python-frog 0.6.10 was successful (score=4/5), but there are some remarks which you may or may not want to address:

1. Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (This is missing in the metadata)
2. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)
Rating
★ ★ ★ ★ ☆
(log file starts at Sat Jun 15 03:15:20 UTC 2024)

[harvester info] --> Processing python-frog (https://github.com/proycon/python-frog) [Sat Jun 15 03:15:20 UTC 2024]

[harvester info] Git updating cached clone of https://github.com/proycon/python-frog...

[harvester info] Found release v0.6.10

[harvester info] Using 'v0.6.10'

[harvester info] Git reference: v0.6.10

[harvester info] Scanning directory /tmp/codemeta-harvester.cache/python-frog for harvestable resources...

[harvester info] found codemeta-harvest.json for python-frog (md5sum 183f4daae6e1c0854dd236fdca078749); values in here take precendence over (override) those in later detection stages

[harvester info] found python setup for python-frog, converting to codemeta

[harvester info] Looking for license....

[harvester info] Found license GPL-3.0-only

[harvester info] Getting contributors from git...

[harvester info] No git contributors found

[harvester info] Getting top contributor from git...

[harvester info] Git top contributor  will be assigned as author (and maintainer) if none are found in the metadata

[harvester info] Extracting last and first commit date from git log....

[harvester info] Date created: 2014-09-07T22:33:37Z+0200, date modified: 2023-12-05T15:47:42Z+0100

[harvester info] Querying Github/GitLab API (https://github.com/proycon/python-frog)

[harvester info] Adding URL for found README: README.rst

[harvester info] Found releaseNotes

[harvester info] Querying Zenodo API for DOI (access token provided)...

[harvester info] Found DOI https://doi.org/10.5281/zenodo.10262701

[harvester info] Converting README.rst to README.md

[harvester info] Looking for TRL information in README.md...

[harvester info] Looking for repostatus information in README.md...

[harvester info] Found repostatus https://www.repostatus.org/#active

[harvester info] Looking for continuous integration information in README.md...

[harvester info] Looking for documentation links in README.md...

[harvester info] Scraping title from http://folia.readthedocs.io/

[harvester info] Found documentation at http://folia.readthedocs.io/ : "name": "FoLiA: Format for Linguistic Annotation - Documentation and Reference Guide — FoLiA: Format for Linguistic Annotation v2.0 (rev 9.0) documentation",

[harvester info] Falling back to git tag (v0.6.10) if no version number is specified...

[harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)...

[harvester info] Inferred repostatus https://www.repostatus.org/#active

[harvester info] Looking for repostatus information in README.rst in master branch...

[harvester info] Found repostatus (master branch) https://www.repostatus.org/#active

[harvester info] Setting group Frog

[harvester info] Reconciliating: codemetapy  --baseuri https://tools.clariah.nl --baseuri https://tools.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "python-frog" --codeRepository "https://github.com/proycon/python-frog" --validate /etc/software.ttl --released --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-version.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/90-authors.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/50-documentation.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/43-releasenotes.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/40-gitapi.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/39-gitdate.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/29-license.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/20-python.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/11-repostatus.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/10-harvest.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-repostatus.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-doi.python-frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.python-frog.codemeta.json 

-- begin log --

Passed 15 files/sources but specified 0 input types! Automatically guessing types...

Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-version.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/99-repostatus.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/90-authors.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/50-documentation.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/43-releasenotes.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/40-gitapi.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/39-gitdate.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/29-license.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/20-python.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/11-repostatus.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/10-harvest.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-repostatus.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-doi.python-frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/04-applicationSuite.python-frog.codemeta.json', 'json')]

Adding to contextgraph: /tmp/turtle

Initial URI automatically generated, may be overriden later: https://tools.clariah.nl/python-frog

Processing source #1 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-version.python-frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] processed 1 new triples, total is now 2

Processing source #2 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.python-frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] processed 1 new triples, total is now 3

Processing source #3 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/90-authors.python-frog.codemeta.json

    Found main resource with URI https://tools.clariah.nl/python-frog.topcontributor/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] processed 1 new triples, total is now 3

Processing source #4 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/50-documentation.python-frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] processed 4 new triples, total is now 7

Processing source #5 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/43-releasenotes.python-frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] processed 2 new triples, total is now 9

Processing source #6 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.python-frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] processed 1 new triples, total is now 10

Processing source #7 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/40-gitapi.python-frog.codemeta.json

    Found main resource with URI https://tools.clariah.nl/python-frog/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] processed 21 new triples, total is now 30

Processing source #8 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/39-gitdate.python-frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] overriding old http://schema.org/dateCreated (2014-09-07T20:32:31Z -> 2014-09-07T22:33:37Z+0200)

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] overriding old http://schema.org/dateModified (2023-12-05T15:32:22Z -> 2023-12-05T15:47:42Z+0100)

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] processed 2 new triples, total is now 30

Processing source #9 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/29-license.python-frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] overriding old http://schema.org/license (http://spdx.org/licenses/GPL-3.0-only -> GPL-3.0-only)

[CODEMETA CORRECTION (https://tools.clariah.nl/python-frog)] automatically converting license to spdx URI

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] processed 1 new triples, total is now 30

Processing source #10 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/20-python.python-frog.codemeta.json

    Found main resource with URI https://tools.clariah.nl/python-frog/0.6.10

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (python-frog)] overriding old http://schema.org/author (https://tools.clariah.nl/stub/H2f337305d29c8d9c -> https://tools.clariah.nl/stub/H-10ab83df435780f9)

[CODEMETA COMPOSITION (python-frog)] overriding old http://schema.org/description (Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser) -> Python binding to Frog, an NLP suite for Dutch doing part-of-speech tagging, lemmatisation, morphological analysis, named-entity recognition, shallow parsing, and dependency parsing.)

[CODEMETA COMPOSITION (python-frog)] overriding old http://schema.org/version (v0.6.10 -> 0.6.10)

[CODEMETA COMPOSITION (python-frog)] processed 50 new triples, total is now 67

Processing source #11 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/11-repostatus.python-frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (python-frog)] processed 1 new triples, total is now 67

Processing source #12 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/10-harvest.python-frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (python-frog)] overriding old http://schema.org/applicationCategory (Text Processing > Linguistic -> https://vocabs.dariah.eu/tadirah/annotating)

[CODEMETA COMPOSITION (python-frog)] overriding old https://codemeta.github.io/terms/developmentStatus (https://w3id.org/research-technology-readiness-levels#Level8Complete -> https://w3id.org/research-technology-readiness-levels#Level9Proven)

[CODEMETA COMPOSITION (python-frog)] processed 23 new triples, total is now 87

Processing source #13 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-repostatus.python-frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (python-frog)] processed 1 new triples, total is now 87

Processing source #14 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-doi.python-frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (python-frog)] overriding old http://schema.org/identifier (python-frog -> )

[CODEMETA COMPOSITION (python-frog)] processed 5 new triples, total is now 91

Processing source #15 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.python-frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/python-frog

[CODEMETA COMPOSITION (https://tools.clariah.nl/python-frog)] processed 1 new triples, total is now 92

Remapping URI to (possibly) new identifier and version component: https://tools.clariah.nl/python-frog -> https://tools.clariah.nl/python-frog/0.6.10

[CODEMETA VALIDATION (python-frog)] done

[CODEMETA ENRICHMENT (python-frog)] automatically adding programmingLanguage Python derived from runtimePlatform Python

[CODEMETA ENRICHMENT (python-frog)] adding author https://tools.clariah.nl/person/maarten-van-gompel as contributor

VALIDATION https://tools.clariah.nl/python-frog/0.6.10 #1: Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/python-frog/0.6.10 #2: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)

-- end log --

[harvester info] Output written to /tmp/out/python-frog.codemeta.json

[harvester info] <-- Finished processing python-frog (https://github.com/proycon/python-frog) [Sat Jun 15 03:15:33 UTC 2024]

        

Metadata Properties

Version
0.6.10 (release notes)
Interface types
  • Unknown
Software website
Source code repository
 https://github.com/proycon/python-frog  Stars are an indicator of the popularity of this project on GitHub
Category
  • Annotating
  • Contextualizing
  • Linguistics
  • Named Entity Recognition
  • POS-Tagging
  • Segmenting
  • Tagging
  • Textual and content analysis
  • Tree-Tagging
Keywords
  • nlp computational_linguistics dutch pos lemmatizer
Development Status
  • 9 - Proven: Technology complete and proven in practice by real users.
  • Active: The project has reached a stable, usable state and is being actively developed.
Issue Tracker (Support)
https://github.com/proycon/python-frog/issues  The number of open issues on the issue tracker  The number of closes issues on the issue tracker
Documentation
License
Author(s)
Maintainer(s)
Contributor(s)
Producer
Programming Language
  • Python
Continuous Integration Tests
https://github.com/proycon/python-frog/actions/workflows/python-frog.yml
Runtime Platform
  • Cython
  • Python 3
Operating System
  • BSD
  • Linux
  • macOS
Software dependencies
  • Cython
Metadata validation
★ ★ ★ ★ ☆
Created
2014-09-07 22:33:37 +0200
Last modified
2023-12-05 15:47:42 +0100  Last commit (main branch). Gives an indication of project development activity and rough indication of how up-to-date the latest release is.  Number of commits since the last release. Gives an indication of project development activity and rough indication of how up-to-date the latest release is.