analiticcl

Analiticcl is an approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation

Citation

You can cite this software using the following citation generated from its metadata:

(2024) analiticcl 0.4.7 .
  • KNAW Humanities Cluster & CLST, Radboud University
.

Logs & Reviews

Name
Automatic software metadata validation report for analiticcl 0.4.7
Author
  • codemetapy validator using software.ttl
Date
2024-12-09 03:00:45
Review
Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems

Validation of analiticcl 0.4.7 was successful (score=3/5), but there are some warnings which should be addressed:

1. Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (This is missing in the metadata)
2. Warning: Documentation *SHOULD* be expressed (The metadata does express this currently, but something is wrong in the way it is expressed. Is the type/class valid?)
3. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)
4. Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)
5. Info: The technology readiness level *SHOULD* be expressed (This is missing in the metadata)
Rating
★ ★ ★ ☆ ☆
(log file starts at Mon Dec  9 03:00:27 UTC 2024)

[harvester info] --> Processing analiticcl (https://github.com/proycon/analiticcl) [Mon Dec  9 03:00:27 UTC 2024]

[harvester info] Git updating cached clone of https://github.com/proycon/analiticcl...

[harvester info] Found release v0.4.7

[harvester info] Using 'v0.4.7'

[harvester info] Git reference: v0.4.7

[harvester info] Scanning directory /tmp/codemeta-harvester.cache/analiticcl for harvestable resources...

[harvester info] found Cargo.toml (rust) for analiticcl, converting to codemeta

[harvester info] Looking for license....

[harvester info] Found license GPL-3.0-only

[harvester info] Parsing MAINTAINERS...

[harvester info] Getting contributors from git...

[harvester info] No git contributors found

[harvester info] Getting top contributor from git...

[harvester info] Git top contributor  will be assigned as author (and maintainer) if none are found in the metadata

[harvester info] Extracting last and first commit date from git log....

[harvester info] Date created: 2021-04-13T18:30:22Z+0200, date modified: 2024-10-16T11:59:28Z+0200

[harvester info] Querying Github/GitLab API (https://github.com/proycon/analiticcl)

[harvester info] Adding URL for found README: README.md

[harvester info] Found releaseNotes

[harvester info] Querying Zenodo API for DOI (access token provided)...

[harvester info] Found DOI https://doi.org/10.5281/zenodo.13939174

[harvester info] Looking for TRL information in README.md...

[harvester info] Looking for repostatus information in README.md...

[harvester info] Found repostatus https://www.repostatus.org/#active

[harvester info] Looking for continuous integration information in README.md...

[harvester info] Found CI https://github.com/proycon/analiticcl/actions/workflows/analiticcl.yml

[harvester info] Looking for documentation links in README.md...

[harvester info] Scraping title from https://docs.rs/analiticcl/

[harvester info] Found documentation at https://docs.rs/analiticcl/ : "name": "analiticcl - Rust",

[harvester info] Falling back to git tag (v0.4.7) if no version number is specified...

[harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)...

[harvester info] Inferred repostatus https://www.repostatus.org/#active

[harvester info] Looking for repostatus information in README.md in master branch...

[harvester info] Found repostatus (master branch) https://www.repostatus.org/#active

[harvester info] Parsing MAINTAINERS from master branch...

[harvester info] Reconciliating: codemetapy  --baseuri https://tools.clariah.nl --baseuri https://tools.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "analiticcl" --codeRepository "https://github.com/proycon/analiticcl" --validate /etc/software.ttl --released --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-version.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/90-authors.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/50-documentation.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/43-releasenotes.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/40-gitapi.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/39-gitdate.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/30-maintainers.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/29-license.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/23-rust.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/12-ci.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/11-repostatus.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-repostatus.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-maintainers.analiticcl.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-doi.analiticcl.codemeta.json 

-- begin log --

Passed 16 files/sources but specified 0 input types! Automatically guessing types...

Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-version.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/99-repostatus.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/90-authors.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/50-documentation.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/43-releasenotes.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/40-gitapi.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/39-gitdate.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/30-maintainers.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/29-license.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/23-rust.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/12-ci.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/11-repostatus.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-repostatus.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-maintainers.analiticcl.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-doi.analiticcl.codemeta.json', 'json')]

Adding to contextgraph: /tmp/turtle

Initial URI automatically generated, may be overriden later: https://tools.clariah.nl/analiticcl

Processing source #1 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-version.analiticcl.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 1 new triples, total is now 2

Processing source #2 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.analiticcl.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 1 new triples, total is now 3

Processing source #3 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/90-authors.analiticcl.codemeta.json

    Found main resource with URI https://tools.clariah.nl/analiticcl.topcontributor/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 1 new triples, total is now 3

Processing source #4 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/50-documentation.analiticcl.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 4 new triples, total is now 7

Processing source #5 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/43-releasenotes.analiticcl.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 2 new triples, total is now 9

Processing source #6 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.analiticcl.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 1 new triples, total is now 10

Processing source #7 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/40-gitapi.analiticcl.codemeta.json

    Found main resource with URI https://tools.clariah.nl/analiticcl/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 26 new triples, total is now 35

Processing source #8 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/39-gitdate.analiticcl.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] overriding old http://schema.org/dateCreated (2021-04-19T21:14:09Z -> 2021-04-13T18:30:22Z+0200)

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] overriding old http://schema.org/dateModified (2024-10-17T18:44:21Z -> 2024-10-16T11:59:28Z+0200)

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 2 new triples, total is now 35

Processing source #9 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/30-maintainers.analiticcl.codemeta.json

    Found main resource with URI https://tools.clariah.nl/maintainers/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 6 new triples, total is now 35

Processing source #10 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/29-license.analiticcl.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] overriding old http://schema.org/license (http://spdx.org/licenses/GPL-3.0-only -> GPL-3.0-only)

[CODEMETA CORRECTION (https://tools.clariah.nl/analiticcl)] automatically converting license to spdx URI

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 1 new triples, total is now 35

Processing source #11 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/23-rust.analiticcl.codemeta.json

    Found main resource with URI https://tools.clariah.nl/cargo.toml/0.4.7

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] overriding old http://schema.org/author (https://tools.clariah.nl/stub/H387908bcb4f0e621 -> https://tools.clariah.nl/stub/H76192cc942f5261a)

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] overriding old http://schema.org/description (an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction -> Analiticcl is an approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation)

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] overriding old http://schema.org/keywords (normalization -> linguistics)

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] overriding old http://schema.org/keywords (fuzzy-matching -> linguistics)

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] overriding old http://schema.org/keywords (approximate-string-matching -> linguistics)

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] overriding old http://schema.org/license (http://spdx.org/licenses/GPL-3.0-only -> http://spdx.org/licenses/GPL-3.0-or-later)

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] overriding old https://codemeta.github.io/terms/readme (https://github.com/proycon/analiticcl/blob/v0.4.7//README.md -> https://tools.clariah.nl/README.md)

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] overriding old http://schema.org/softwareHelp (https://docs.rs/analiticcl/ -> https://docs.rs/analiticcl)

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] overriding old http://schema.org/version (v0.4.7 -> 0.4.7)

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 63 new triples, total is now 80

Processing source #12 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/12-ci.analiticcl.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 1 new triples, total is now 81

Processing source #13 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/11-repostatus.analiticcl.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 1 new triples, total is now 81

Processing source #14 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-repostatus.analiticcl.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 1 new triples, total is now 81

Processing source #15 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-maintainers.analiticcl.codemeta.json

    Found main resource with URI https://tools.clariah.nl/maintainers/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 6 new triples, total is now 81

Processing source #16 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-doi.analiticcl.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/analiticcl

[CODEMETA COMPOSITION (https://tools.clariah.nl/analiticcl)] processed 5 new triples, total is now 86

Remapping URI to (possibly) new identifier and version component: https://tools.clariah.nl/analiticcl -> https://tools.clariah.nl/analiticcl/0.4.7

[CODEMETA VALIDATION (analiticcl)] done

[CODEMETA ENRICHMENT (analiticcl)] adding author https://tools.clariah.nl/person/maarten-van-gompel as contributor

[CODEMETA ENRICHMENT (analiticcl)] adding affiliation(s) of first author as producer

VALIDATION https://tools.clariah.nl/analiticcl/0.4.7 #1: Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/analiticcl/0.4.7 #2: Warning: Documentation *SHOULD* be expressed (The metadata does express this currently, but something is wrong in the way it is expressed. Is the type/class valid?)

VALIDATION https://tools.clariah.nl/analiticcl/0.4.7 #3: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/analiticcl/0.4.7 #4: Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/analiticcl/0.4.7 #5: Info: The technology readiness level *SHOULD* be expressed (This is missing in the metadata)

-- end log --

[harvester info] Output written to /tmp/out/analiticcl.codemeta.json

[harvester info] <-- Finished processing analiticcl (https://github.com/proycon/analiticcl) [Mon Dec  9 03:00:45 UTC 2024]

        

Metadata Properties

Version
0.4.7 (release notes)
Interface types
  • Unknown
Software website
Source code repository
 https://github.com/proycon/analiticcl  Stars are an indicator of the popularity of this project on GitHub
Keywords
  • linguistics
  • nlp
  • spellcheck
  • spelling-correction
  • text-processing
Development Status
  • Active: The project has reached a stable, usable state and is being actively developed.
Issue Tracker (Support)
https://github.com/proycon/analiticcl/issues  The number of open issues on the issue tracker  The number of closes issues on the issue tracker
Documentation
License
Author(s)
Maintainer(s)
Contributor(s)
Producer
  •   KNAW Humanities Cluster & CLST, Radboud University
Programming Language
  • Rust
Continuous Integration Tests
https://github.com/proycon/analiticcl/actions/workflows/analiticcl.yml
Software dependencies
  • bitflags
  • clap
  • ibig
  • num-traits
  • rayon
  • rustfst
  • sesdiff
  • simple-error
Metadata validation
★ ★ ★ ☆ ☆
Created
2021-04-13 18:30:22 +0200
Last modified
2024-10-16 11:59:28 +0200  Last commit (main branch). Gives an indication of project development activity and rough indication of how up-to-date the latest release is.  Number of commits since the last release. Gives an indication of project development activity and rough indication of how up-to-date the latest release is.