stam-tools

Command-line tools for working with stand-off annotations on text (STAM)

Provided tools & services

stam-tools

Type
  • Command-line Application

Tool suite: STAM

The following closely related tools are in a tool suite together with stam-tools:

  • Experimental: The technology is implemented and ready for experimental settings (beta), but requires further work and validation.
  • Active: The project has reached a stable, usable state and is being actively developed.
thumbnail/logo

stam v1.1.1

Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an annotation. This repository contains the model's full specification, extensions, schemas, examples and documentation. [view more]
  • Annotating
  • Textual and content analysis
  • Textual and linguistic corpora
  • annotation
  • linguistics
  • stand-off
  • text
  • text-annotation
  • webannotation
Created: 2021-09-09
Modified: 2024-09-17
  • Software Library
  • 7 - Release Candidate: Technology ready enough and in initial use by end-users in intended scholarly environments. Further validation may be in progress.
  • Active: The project has reached a stable, usable state and is being actively developed.
thumbnail/logo

stam 0.10.1

STAM is a library for dealing with standoff annotations on text, this is the python binding. [view more]
  • Annotating
  • Textual and content analysis
  • Textual and linguistic corpora
  • annotation
  • linguistics
  • nlp
  • standoff
  • text-processing
Created: 2023-01-31
Modified: 2024-10-18
  • Software Library
  • 7 - Release Candidate: Technology ready enough and in initial use by end-users in intended scholarly environments. Further validation may be in progress.
  • Active: The project has reached a stable, usable state and is being actively developed.
thumbnail/logo

stam 0.16.5

STAM is a powerful library for dealing with stand-off annotations on text. This is the Rust library. [view more]
  • Annotating
  • Textual and content analysis
  • Textual and linguistic corpora
  • annotation
  • linguistics
  • nlp
  • standoff
  • text-processing
Created: 2023-01-03
Modified: 2024-11-18

Citation

You can cite this software using the following citation generated from its metadata:

Logs & Reviews

Name
Automatic software metadata validation report for stam-tools 0.9.2
Author
  • codemetapy validator using software.ttl
Date
2024-12-09 03:15:48
Review
Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems

Validation of stam-tools 0.9.2 was successful (score=3/5), but there are some warnings which should be addressed:

1. Info: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata)
2. Warning: Documentation *SHOULD* be expressed (The metadata does express this currently, but something is wrong in the way it is expressed. Is the type/class valid?)
3. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)
Rating
★ ★ ★ ☆ ☆
(log file starts at Mon Dec  9 03:15:36 UTC 2024)

[harvester info] --> Processing stam-tools (https://github.com/annotation/stam-tools) [Mon Dec  9 03:15:36 UTC 2024]

[harvester info] Git updating cached clone of https://github.com/annotation/stam-tools...

[harvester info] Found release v0.9.2

[harvester info] Using 'v0.9.2'

[harvester info] Git reference: v0.9.2

[harvester info] Scanning directory /tmp/codemeta-harvester.cache/stam-tools for harvestable resources...

[harvester info] found codemeta-harvest.json for stam-tools (md5sum 2b827fccacf74d2f577ceb5c4ab5afe0); values in here take precendence over (override) those in later detection stages

[harvester info] found Cargo.toml (rust) for stam-tools, converting to codemeta

[harvester info] Looking for license....

[harvester info] Found license GPL-3.0-only

[harvester info] Getting contributors from git...

[harvester info] No git contributors found

[harvester info] Getting top contributor from git...

[harvester info] Git top contributor  will be assigned as author (and maintainer) if none are found in the metadata

[harvester info] Extracting last and first commit date from git log....

[harvester info] Date created: 2023-03-21T15:44:21Z+0100, date modified: 2024-11-18T16:51:34Z+0100

[harvester info] Querying Github/GitLab API (https://github.com/annotation/stam-tools)

[harvester info] Adding URL for found README: README.md

[harvester info] Found releaseNotes

[harvester info] Querying Zenodo API for DOI (access token provided)...

[harvester info] Found DOI https://doi.org/10.5281/zenodo.14181350

[harvester info] Looking for TRL information in README.md...

[harvester info] Found TRL https://w3id.org/research-technology-readiness-levels#Level7ReleaseCandidate

[harvester info] Looking for repostatus information in README.md...

[harvester info] Found repostatus https://www.repostatus.org/#active

[harvester info] Looking for continuous integration information in README.md...

[harvester info] Looking for documentation links in README.md...

[harvester info] Scraping title from https://docs.rs/regex/latest/regex/#syntax

[harvester info] Found documentation at https://docs.rs/regex/latest/regex/#syntax : "name": "regex - Rust",

[harvester info] Scraping title from https://docs.rs/stam-tools/

[harvester info] Found documentation at https://docs.rs/stam-tools/ : "name": "stamtools - Rust",

[harvester info] Falling back to git tag (v0.9.2) if no version number is specified...

[harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)...

[harvester info] Inferred repostatus https://www.repostatus.org/#active

[harvester info] Looking for repostatus information in README.md in master branch...

[harvester info] Found repostatus (master branch) https://www.repostatus.org/#active

[harvester info] Setting group STAM

[harvester info] Reconciliating: codemetapy  --baseuri https://tools.clariah.nl --baseuri https://tools.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "stam-tools" --codeRepository "https://github.com/annotation/stam-tools" --validate /etc/software.ttl --released --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-version.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/90-authors.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/50-documentation.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/43-releasenotes.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/40-gitapi.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/39-gitdate.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/29-license.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/23-rust.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/11-trl.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/11-repostatus.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/10-harvest.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-repostatus.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-doi.stam-tools.codemeta.json /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.stam-tools.codemeta.json 

-- begin log --

Passed 16 files/sources but specified 0 input types! Automatically guessing types...

Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-version.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/99-repostatus.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/90-authors.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/50-documentation.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/43-releasenotes.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/40-gitapi.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/39-gitdate.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/29-license.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/23-rust.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/11-trl.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/11-repostatus.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/10-harvest.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-repostatus.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-doi.stam-tools.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/04-applicationSuite.stam-tools.codemeta.json', 'json')]

Adding to contextgraph: /tmp/turtle

Initial URI automatically generated, may be overriden later: https://tools.clariah.nl/stam-tools

Processing source #1 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-version.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 1 new triples, total is now 2

Processing source #2 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 1 new triples, total is now 3

Processing source #3 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/90-authors.stam-tools.codemeta.json

    Found main resource with URI https://tools.clariah.nl/stam-tools.topcontributor/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 1 new triples, total is now 3

Processing source #4 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/50-documentation.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 8 new triples, total is now 11

Processing source #5 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/43-releasenotes.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 2 new triples, total is now 13

Processing source #6 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 1 new triples, total is now 14

Processing source #7 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/40-gitapi.stam-tools.codemeta.json

    Found main resource with URI https://tools.clariah.nl/stam-tools/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 15 new triples, total is now 28

Processing source #8 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/39-gitdate.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] overriding old http://schema.org/dateCreated (2023-03-21T14:43:17Z -> 2023-03-21T15:44:21Z+0100)

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] overriding old http://schema.org/dateModified (2024-11-18T15:52:57Z -> 2024-11-18T16:51:34Z+0100)

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 2 new triples, total is now 28

Processing source #9 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/29-license.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] overriding old http://schema.org/license (http://spdx.org/licenses/GPL-3.0-only -> GPL-3.0-only)

[CODEMETA CORRECTION (https://tools.clariah.nl/stam-tools)] automatically converting license to spdx URI

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 1 new triples, total is now 28

Processing source #10 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/23-rust.stam-tools.codemeta.json

    Found main resource with URI https://tools.clariah.nl/cargo.toml/0.9.2

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] overriding old http://schema.org/description (Command line tools for working with standoff text annotations (STAM) -> Command-line tools for working with stand-off annotations on text (STAM))

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] overriding old https://codemeta.github.io/terms/readme (https://github.com/annotation/stam-tools/blob/v0.9.2//README.md -> https://tools.clariah.nl/README.md)

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] overriding old http://schema.org/softwareHelp (https://docs.rs/regex/latest/regex/#syntax -> https://github.com/annotation/stam-tools)

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] overriding old http://schema.org/softwareHelp (https://docs.rs/stam-tools/ -> https://github.com/annotation/stam-tools)

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] overriding old http://schema.org/version (v0.9.2 -> 0.9.2)

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 63 new triples, total is now 80

Processing source #11 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/11-trl.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 1 new triples, total is now 81

Processing source #12 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/11-repostatus.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 1 new triples, total is now 81

Processing source #13 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/10-harvest.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] overriding old http://schema.org/producer (https://tools.clariah.nl/org/annotation -> https://tools.clariah.nl/stub/H72b4051873615c1a)

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 14 new triples, total is now 94

Processing source #14 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-repostatus.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 1 new triples, total is now 94

Processing source #15 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-doi.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stam-tools)] processed 5 new triples, total is now 99

Processing source #16 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.stam-tools.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/stam-tools

[CODEMETA COMPOSITION (https://tools.clariah.nl/stub/H605a47173ea1bf87)] processed 1 new triples, total is now 100

Remapping URI to (possibly) new identifier and version component: https://tools.clariah.nl/stam-tools -> https://tools.clariah.nl/stam-tools/0.9.2

[CODEMETA VALIDATION (stam-tools)] done

[CODEMETA ENRICHMENT (stam-tools)] Guessing interface type https://w3id.org/software-types#CommandLineApplication based on clues

[CODEMETA ENRICHMENT (stam-tools)] adding author https://tools.clariah.nl/person/maarten-van-gompel as contributor

[CODEMETA ENRICHMENT (stam-tools)] considering first author as maintainer

VALIDATION https://tools.clariah.nl/stam-tools/0.9.2 #1: Info: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/stam-tools/0.9.2 #2: Warning: Documentation *SHOULD* be expressed (The metadata does express this currently, but something is wrong in the way it is expressed. Is the type/class valid?)

VALIDATION https://tools.clariah.nl/stam-tools/0.9.2 #3: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)

-- end log --

[harvester info] Output written to /tmp/out/stam-tools.codemeta.json

[harvester info] <-- Finished processing stam-tools (https://github.com/annotation/stam-tools) [Mon Dec  9 03:15:48 UTC 2024]

        

Metadata Properties

Version
0.9.2 (release notes)
Interface types
  • Command-line Application
Software website
Source code repository
 https://github.com/annotation/stam-tools  Stars are an indicator of the popularity of this project on GitHub
Category
  • Annotating
  • Textual and content analysis
  • Textual and linguistic corpora
Keywords
  • annotation
  • linguistics
  • nlp
  • standoff
  • text-processing
Development Status
  • 7 - Release Candidate: Technology ready enough and in initial use by end-users in intended scholarly environments. Further validation may be in progress.
  • Active: The project has reached a stable, usable state and is being actively developed.
Issue Tracker (Support)
https://github.com/annotation/stam-tools/issues  The number of open issues on the issue tracker  The number of closes issues on the issue tracker
Documentation
License
Author(s)
Maintainer(s)
Contributor(s)
Producer
Programming Language
  • Rust
Software dependencies
  • atty
  • clap
  • html-escape
  • roxmltree
  • seal
  • serde
  • stam
  • toml
Metadata validation
★ ★ ★ ☆ ☆
Created
2023-03-21 15:44:21 +0100
Last modified
2024-11-18 16:51:34 +0100  Last commit (main branch). Gives an indication of project development activity and rough indication of how up-to-date the latest release is.  Number of commits since the last release. Gives an indication of project development activity and rough indication of how up-to-date the latest release is.