The following closely related tools are in a tool suite together with textrepo:
You can cite this software using the following citation generated from its metadata:
Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems Validation of textrepo v1.19.0 failed (score 2/5) due to one or more requirement violations: 1. Violation: The maintainer of the software source code *MUST* be expressed. (This is missing in the metadata) 2. Info: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata) 3. Info: Software source code *MAY* express the programming language(s) used (This is missing in the metadata) 4. Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (This is missing in the metadata) 5. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata) 6. Info: The funder *SHOULD* be acknowledged (This is missing in the metadata) 7. Info: The technology readiness level *SHOULD* be expressed (This is missing in the metadata)
(log file starts at Sat Feb 7 03:31:40 UTC 2026)
[harvester info] --> Processing textrepo (https://github.com/knaw-huc/textrepo) [Sat Feb 7 03:31:40 UTC 2026]
[harvester info] Git updating cached clone of https://github.com/knaw-huc/textrepo...
[harvester info] Found release v1.19.0
[harvester info] Using 'v1.19.0'
[harvester info] Git reference: v1.19.0
[harvester info] Scanning directory /tmp/codemeta-harvester.cache/textrepo for harvestable resources...
[harvester info] Looking for license....
[harvester info] Found license Apache-2.0
[harvester info] Getting contributors from git...
[harvester info] Getting top contributor from git...
[harvester info] Git top contributor HDJ <hayco@users.noreply.github.com> will be assigned as author (and maintainer) if none are found in the metadata
[harvester info] Extracting last and first commit date from git log....
[harvester info] Date created: 2019-08-07T17:26:01Z+0200, date modified: 2022-03-15T14:51:17Z+0100
[harvester info] Querying Github/GitLab API (https://github.com/knaw-huc/textrepo)
[harvester info] Adding URL for found README: README.md
[harvester info] Found releaseNotes
[harvester info] Querying Zenodo API for DOI (access token provided)...
[harvester info] Looking for TRL information in README.md...
[harvester info] Looking for repostatus information in README.md...
[harvester info] Looking for continuous integration information in README.md...
[harvester info] Looking for documentation links in README.md...
[harvester info] Scraping title from http://textrepo.readthedocs.io/en/latest/
[harvester info] Found documentation at http://textrepo.readthedocs.io/en/latest/ : "name": "Text Repository — Text Repository documentation",
[harvester info] Falling back to git tag (v1.19.0) if no version number is specified...
[harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)...
[harvester info] Inferred repostatus https://www.repostatus.org/#inactive
[harvester info] Looking for repostatus information in README.md in master branch...
[harvester info] Found repostatus (master branch) https://www.repostatus.org/#active
[harvester info] Setting group TextRepo
[harvester info] Reconciliating: codemetapy --baseuri https://tools.clariah.nl --baseuri https://tools.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "textrepo" --codeRepository "https://github.com/knaw-huc/textrepo" --validate /etc/software.ttl --released --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/textrepo.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-version.textrepo.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.textrepo.codemeta.json /tmp/codemeta-harvester.cache//tmp/90-authors.textrepo.codemeta.json /tmp/codemeta-harvester.cache//tmp/50-documentation.textrepo.codemeta.json /tmp/codemeta-harvester.cache//tmp/43-releasenotes.textrepo.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.textrepo.codemeta.json /tmp/codemeta-harvester.cache//tmp/40-gitapi.textrepo.codemeta.json /tmp/codemeta-harvester.cache//tmp/39-gitdate.textrepo.codemeta.json /tmp/codemeta-harvester.cache//tmp/32-contributors.textrepo.codemeta.json /tmp/codemeta-harvester.cache//tmp/29-license.textrepo.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-repostatus.textrepo.codemeta.json /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.textrepo.codemeta.json
-- begin log --
/usr/lib/python3.12/site-packages/pyshacl/extras/__init__.py:6: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
import pkg_resources
Passed 12 files/sources but specified 0 input types! Automatically guessing types...
Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-version.textrepo.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/99-repostatus.textrepo.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/90-authors.textrepo.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/50-documentation.textrepo.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/43-releasenotes.textrepo.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.textrepo.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/40-gitapi.textrepo.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/39-gitdate.textrepo.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/32-contributors.textrepo.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/29-license.textrepo.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-repostatus.textrepo.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/04-applicationSuite.textrepo.codemeta.json', 'json')]
Adding to contextgraph: /tmp/turtle
Initial URI automatically generated, may be overriden later: https://tools.clariah.nl/textrepo
Processing source #1 of 12
Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-version.textrepo.codemeta.json
NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...
Injected (possibly temporary) URI https://tools.clariah.nl/textrepo
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] processed 1 new triples, total is now 2
Processing source #2 of 12
Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.textrepo.codemeta.json
NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...
Injected (possibly temporary) URI https://tools.clariah.nl/textrepo
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] processed 1 new triples, total is now 3
Processing source #3 of 12
Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/90-authors.textrepo.codemeta.json
Found main resource with URI https://tools.clariah.nl/textrepo.topcontributor/snapshot
Injected (possibly temporary) URI https://tools.clariah.nl/textrepo
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] processed 8 new triples, total is now 10
Processing source #4 of 12
Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/50-documentation.textrepo.codemeta.json
NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...
Injected (possibly temporary) URI https://tools.clariah.nl/textrepo
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] processed 4 new triples, total is now 14
Processing source #5 of 12
Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/43-releasenotes.textrepo.codemeta.json
NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...
Injected (possibly temporary) URI https://tools.clariah.nl/textrepo
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] processed 2 new triples, total is now 16
Processing source #6 of 12
Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.textrepo.codemeta.json
NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...
Injected (possibly temporary) URI https://tools.clariah.nl/textrepo
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] processed 1 new triples, total is now 17
Processing source #7 of 12
Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/40-gitapi.textrepo.codemeta.json
Found main resource with URI https://tools.clariah.nl/textrepo/snapshot
Injected (possibly temporary) URI https://tools.clariah.nl/textrepo
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] processed 13 new triples, total is now 29
Processing source #8 of 12
Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/39-gitdate.textrepo.codemeta.json
NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...
Injected (possibly temporary) URI https://tools.clariah.nl/textrepo
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] overriding old http://schema.org/dateCreated (2019-08-07T15:28:33Z -> 2019-08-07T17:26:01Z+0200)
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] overriding old http://schema.org/dateModified (2026-02-03T14:41:49Z -> 2022-03-15T14:51:17Z+0100)
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] processed 2 new triples, total is now 29
Processing source #9 of 12
Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/32-contributors.textrepo.codemeta.json
Found main resource with URI https://tools.clariah.nl/textrepo.contributors/snapshot
Injected (possibly temporary) URI https://tools.clariah.nl/textrepo
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] processed 38 new triples, total is now 62
Processing source #10 of 12
Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/29-license.textrepo.codemeta.json
NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...
Injected (possibly temporary) URI https://tools.clariah.nl/textrepo
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] overriding old http://schema.org/license (http://spdx.org/licenses/Apache-2.0 -> Apache-2.0)
[CODEMETA CORRECTION (https://tools.clariah.nl/textrepo)] automatically converting license to spdx URI
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] processed 1 new triples, total is now 62
Processing source #11 of 12
Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-repostatus.textrepo.codemeta.json
NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...
Injected (possibly temporary) URI https://tools.clariah.nl/textrepo
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] overriding old https://codemeta.github.io/terms/developmentStatus (https://www.repostatus.org/#inactive -> https://www.repostatus.org/#active)
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] processed 1 new triples, total is now 62
Processing source #12 of 12
Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.textrepo.codemeta.json
NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...
Injected (possibly temporary) URI https://tools.clariah.nl/textrepo
[CODEMETA COMPOSITION (https://tools.clariah.nl/textrepo)] processed 1 new triples, total is now 63
Remapping URI to (possibly) new identifier and version component: https://tools.clariah.nl/textrepo -> https://tools.clariah.nl/textrepo/v1.19.0
[CODEMETA VALIDATION (textrepo)] done
[CODEMETA ENRICHMENT (textrepo)] considering first author as maintainer
VALIDATION https://tools.clariah.nl/textrepo/v1.19.0 #1: Violation: The maintainer of the software source code *MUST* be expressed. (This is missing in the metadata)
VALIDATION https://tools.clariah.nl/textrepo/v1.19.0 #2: Info: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata)
VALIDATION https://tools.clariah.nl/textrepo/v1.19.0 #3: Info: Software source code *MAY* express the programming language(s) used (This is missing in the metadata)
VALIDATION https://tools.clariah.nl/textrepo/v1.19.0 #4: Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (This is missing in the metadata)
VALIDATION https://tools.clariah.nl/textrepo/v1.19.0 #5: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)
VALIDATION https://tools.clariah.nl/textrepo/v1.19.0 #6: Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)
VALIDATION https://tools.clariah.nl/textrepo/v1.19.0 #7: Info: The technology readiness level *SHOULD* be expressed (This is missing in the metadata)
-- end log --
[harvester info] Output written to /tmp/out/textrepo.codemeta.json
[harvester info] <-- Finished processing textrepo (https://github.com/knaw-huc/textrepo) [Sat Feb 7 03:31:57 UTC 2026]