Frog

Frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. It performs automatic linguistic enrichment such as part of speech tagging, lemmatisation, named entity recognition, shallow parsing, dependency parsing and morphological analysis. All NLP modules are based on TiMBL.

Provided tools & services

frog

Command-line interface to the full NLP suite
Type
  • Command-line Application
Executable name
frog
Input data
Type
TextDigitalDocument
Encoding Format
application/folia+xml
Language
Dutch
Type
TextDigitalDocument
Encoding Format
text/plain
Language
Dutch
Output data
Type
TextDigitalDocument
Encoding Format
text/plain
Language
Dutch
Type
TextDigitalDocument
Encoding Format
application/folia+xml
Language
Dutch

libfrog

Frog Library with API for C++
Type
  • Software Library
Executable name
libfrog

mblem

Memory-based Lemmatiser (standalone)
Type
  • Command-line Application
Executable name
mblem
Input data
Type
TextDigitalDocument
Encoding Format
text/plain
Language
Dutch
Output data
Type
TextDigitalDocument
Encoding Format
text/plain
Language
Dutch

mbma

Memory-based Morphological Analysis (standalone)
Type
  • Command-line Application
Executable name
mbma
Input data
Type
TextDigitalDocument
Encoding Format
text/plain
Language
Dutch
Output data
Type
TextDigitalDocument
Encoding Format
text/plain
Language
Dutch

ner

Named Entity Recogniser (standalone)
Type
  • Command-line Application
Executable name
ner
Input data
Type
TextDigitalDocument
Encoding Format
text/plain
Language
Dutch
Output data
Type
TextDigitalDocument
Encoding Format
text/plain
Language
Dutch

Tool suite: Frog

The following closely related tools are in a tool suite together with Frog:

  • Web Application
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.
  • Active: The project has reached a stable, usable state and is being actively developed.

Frog-Webservice 2.7

Frog is a suite containing a tokeniser, Part-of-Speech tagger, lemmatiser, morphological analyser, shallow parser, and dependency parser for Dutch. This is the webservice for it, for both humans and machines. [view more]
  • Annotating
  • Contextualizing
  • Linguistics
  • Named Entity Recognition
  • POS-Tagging
  • Segmenting
  • Tagging
  • Textual and content analysis
  • Tree-Tagging
  • clam webservice rest nlp computational_linguistics rest
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2022-02-17
Modified: 2023-12-05
  • 9 - Proven: Technology complete and proven in practice by real users.
  • Active: The project has reached a stable, usable state and is being actively developed.

python-frog 0.6.10

Python binding to Frog, an NLP suite for Dutch doing part-of-speech tagging, lemmatisation, morphological analysis, named-entity recognition, shallow parsing, and dependency parsing. [view more]
  • Annotating
  • Contextualizing
  • Linguistics
  • Named Entity Recognition
  • POS-Tagging
  • Segmenting
  • Tagging
  • Textual and content analysis
  • Tree-Tagging
  • nlp computational_linguistics dutch pos lemmatizer
  • Bsd
  • Cython
  • Linux
  • Macos
  • Python
Created: 2014-09-07
Modified: 2023-12-05
  • Active: The project has reached a stable, usable state and is being actively developed.

toad v0.8

Toad: Trainer Of All Data, the Frog training collection [view more]
Created: 2015-12-08
Modified: 2023-02-22

References

Citation

Please use one of the above reference publications to cite the software, if you want to cite the software directly, you can use the following citation generated from the metadata:

Logs & Reviews

Name
Automatic software metadata validation report for Frog 0.33
Author
  • codemetapy validator using software.ttl
Date
2024-09-15 03:07:39
Review
Validates perfectly, no further remarks!
Rating
★ ★ ★ ★ ★
(log file starts at Sun Sep 15 03:07:33 UTC 2024)

[harvester info] --> Processing frog (https://github.com/LanguageMachines/frog) [Sun Sep 15 03:07:33 UTC 2024]

[harvester info] Git updating cached clone of https://github.com/LanguageMachines/frog...

[harvester info] Found release v0.33

[harvester info] Using 'v0.33'

[harvester info] Git reference: v0.33

[harvester info] Scanning directory /tmp/codemeta-harvester.cache/frog for harvestable resources...

[harvester info] found codemeta.json for frog (md5sum 9c6c0ae3c4395e520f8034ea7057baa8); **NOTE: this is considered authoritative and most other detection methods will be skipped now!**

[harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)...

[harvester info] Inferred repostatus https://www.repostatus.org/#active

[harvester info] Looking for repostatus information in README.md in master branch...

[harvester info] Found repostatus (master branch) https://www.repostatus.org/#active

[harvester info] Looking for repostatus information in README in master branch...

[harvester info] Parsing MAINTAINERS from master branch...

[harvester info] Setting group Frog

[harvester info] Reconciliating: codemetapy  --baseuri https://tools.clariah.nl --baseuri https://tools.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "frog" --codeRepository "https://github.com/LanguageMachines/frog" --validate /etc/software.ttl --released --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/10-jsonld.frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-repostatus.frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-maintainers.frog.codemeta.json /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.frog.codemeta.json 

-- begin log --

Passed 5 files/sources but specified 0 input types! Automatically guessing types...

Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-repostatus.frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/10-jsonld.frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-repostatus.frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-maintainers.frog.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/04-applicationSuite.frog.codemeta.json', 'json')]

Adding to contextgraph: /tmp/turtle

Initial URI automatically generated, may be overriden later: https://tools.clariah.nl/frog

Processing source #1 of 5

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/frog

[CODEMETA COMPOSITION (https://tools.clariah.nl/frog)] processed 1 new triples, total is now 2

Processing source #2 of 5

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/10-jsonld.frog.codemeta.json

    Injected (possibly temporary) URI https://tools.clariah.nl/frog

[CODEMETA CORRECTION (frog)] automatically converting spdx license URI from https:// to http:///

[CODEMETA COMPOSITION (frog)] processed 236 new triples, total is now 236

Processing source #3 of 5

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-repostatus.frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/frog

[CODEMETA COMPOSITION (frog)] processed 1 new triples, total is now 236

Processing source #4 of 5

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-maintainers.frog.codemeta.json

    Found main resource with URI https://tools.clariah.nl/maintainers/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/frog

[CODEMETA COMPOSITION (frog)] processed 14 new triples, total is now 249

Processing source #5 of 5

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.frog.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/frog

[CODEMETA COMPOSITION (frog)] processed 1 new triples, total is now 250

Remapping URI to (possibly) new identifier and version component: https://tools.clariah.nl/frog -> https://tools.clariah.nl/frog/0.33

[CODEMETA VALIDATION (frog)] done

[CODEMETA ENRICHMENT (frog)] adding author https://tools.clariah.nl/stub/H-13888776753f9ff5 as contributor

[CODEMETA ENRICHMENT (frog)] adding author https://orcid.org/0000-0003-2493-656X as contributor

[CODEMETA ENRICHMENT (frog)] adding author https://orcid.org/0000-0002-1046-0006 as contributor

-- end log --

[harvester info] Output written to /tmp/out/frog.codemeta.json

[harvester info] <-- Finished processing frog (https://github.com/LanguageMachines/frog) [Sun Sep 15 03:07:39 UTC 2024]

        

Metadata Properties

Version
0.33 (release notes)
Interface types
  • Command-line Application
  • Software Library
Software website
Source code repository
 https://github.com/LanguageMachines/frog  Stars are an indicator of the popularity of this project on GitHub
Category
  • Annotating
  • Contextualizing
  • Linguistics
  • Named Entity Recognition
  • POS-Tagging
  • Segmenting
  • Tagging
  • Textual and content analysis
  • Tree-Tagging
Keywords
  • dependency parsing
  • dutch
  • lemma
  • lemmatisation
  • natural language processing
  • ner
  • nlp
  • parser
  • part-of-speech tagging
  • pos
  • shallow parsing
  • tagger
Development Status
  • 9 - Proven: Technology complete and proven in practice by real users.
  • Active: The project has reached a stable, usable state and is being actively developed.
Issue Tracker (Support)
https://github.com/LanguageMachines/frog/issues  The number of open issues on the issue tracker  The number of closes issues on the issue tracker
Documentation
License
Author(s)
Maintainer(s)
Contributor(s)
Producer
Programming Language
  • C++
Continuous Integration Tests
https://github.com/LanguageMachines/frog/actions/workflows/frog.yml
Operating System
  • BSD
  • Linux
  • macOS
Software dependencies
  • libxml2
  • libfolia
  • ucto
  • mbt
  • timbl
  • icu
  • ticcutils
Metadata validation
★ ★ ★ ★ ★
Created
2011-03-31 12:35:01 +0000
Last modified
2023-12-05 15:43:06 +0100  Last commit (main branch). Gives an indication of project development activity and rough indication of how up-to-date the latest release is.  Number of commits since the last release. Gives an indication of project development activity and rough indication of how up-to-date the latest release is.