The following closely related tools are in a tool suite together with BlackLab Corpus Search:
You can cite this software using the following citation generated from its metadata:
Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems Validation of BlackLab Corpus Search 3.0.1 was successful (score=3/5), but there are some warnings which should be addressed: 1. Info: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata) 2. Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (This is missing in the metadata) 3. Warning: Documentation *SHOULD* be expressed (This is missing in the metadata) 4. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata) 5. Info: The funder *SHOULD* be acknowledged (This is missing in the metadata) 6. Info: The technology readiness level *SHOULD* be expressed (This is missing in the metadata)
(log file starts at Mon Dec 9 03:02:08 UTC 2024) [harvester info] --> Processing blacklab (https://github.com/INL/BlackLab) [Mon Dec 9 03:02:08 UTC 2024] [harvester info] Git updating cached clone of https://github.com/INL/BlackLab... [harvester info] Found release v3.0.1 [harvester info] Using 'v3.0.1' [harvester info] Git reference: v3.0.1 [harvester info] Scanning directory /tmp/codemeta-harvester.cache/blacklab for harvestable resources... [harvester info] found pom.xml (Java/Maven) for blacklab, converting to codemeta [harvester info] Looking for license.... [harvester info] Found license Apache-2.0 [harvester info] Getting contributors from git... [harvester info] No git contributors found [harvester info] Getting top contributor from git... [harvester info] Git top contributor will be assigned as author (and maintainer) if none are found in the metadata [harvester info] Extracting last and first commit date from git log.... [harvester info] Date created: 2012-10-04T03:59:43Z-0700, date modified: 2022-10-06T13:08:42Z+0200 [harvester info] Querying Github/GitLab API (https://github.com/INL/BlackLab) [harvester info] Adding URL for found README: README.md [harvester info] Found releaseNotes [harvester info] Querying Zenodo API for DOI (access token provided)... [harvester info] Looking for TRL information in README.md... [harvester info] Looking for repostatus information in README.md... [harvester info] Looking for continuous integration information in README.md... [harvester info] Looking for documentation links in README.md... [harvester info] Falling back to git tag (v3.0.1) if no version number is specified... [harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)... [harvester info] Inferred repostatus https://www.repostatus.org/#active [harvester info] Looking for repostatus information in README.md in master branch... [harvester info] Setting group Blacklab & Corpus Search [harvester info] Reconciliating: codemetapy --baseuri https://tools.clariah.nl --baseuri https://tools.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "blacklab" --codeRepository "https://github.com/INL/BlackLab" --validate /etc/software.ttl --released --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/blacklab.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-version.blacklab.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.blacklab.codemeta.json /tmp/codemeta-harvester.cache//tmp/90-authors.blacklab.codemeta.json /tmp/codemeta-harvester.cache//tmp/43-releasenotes.blacklab.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.blacklab.codemeta.json /tmp/codemeta-harvester.cache//tmp/40-gitapi.blacklab.codemeta.json /tmp/codemeta-harvester.cache//tmp/39-gitdate.blacklab.codemeta.json /tmp/codemeta-harvester.cache//tmp/29-license.blacklab.codemeta.json /tmp/codemeta-harvester.cache//tmp/21-java.blacklab.codemeta.json /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.blacklab.codemeta.json -- begin log -- Passed 10 files/sources but specified 0 input types! Automatically guessing types... Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-version.blacklab.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/99-repostatus.blacklab.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/90-authors.blacklab.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/43-releasenotes.blacklab.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.blacklab.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/40-gitapi.blacklab.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/39-gitdate.blacklab.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/29-license.blacklab.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/21-java.blacklab.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/04-applicationSuite.blacklab.codemeta.json', 'json')] Adding to contextgraph: /tmp/turtle Initial URI automatically generated, may be overriden later: https://tools.clariah.nl/blacklab Processing source #1 of 10 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-version.blacklab.codemeta.json NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically... Injected (possibly temporary) URI https://tools.clariah.nl/blacklab [CODEMETA COMPOSITION (https://tools.clariah.nl/blacklab)] processed 1 new triples, total is now 2 Processing source #2 of 10 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.blacklab.codemeta.json NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically... Injected (possibly temporary) URI https://tools.clariah.nl/blacklab [CODEMETA COMPOSITION (https://tools.clariah.nl/blacklab)] processed 1 new triples, total is now 3 Processing source #3 of 10 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/90-authors.blacklab.codemeta.json Found main resource with URI https://tools.clariah.nl/blacklab.topcontributor/snapshot Injected (possibly temporary) URI https://tools.clariah.nl/blacklab [CODEMETA COMPOSITION (https://tools.clariah.nl/blacklab)] processed 1 new triples, total is now 3 Processing source #4 of 10 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/43-releasenotes.blacklab.codemeta.json NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically... Injected (possibly temporary) URI https://tools.clariah.nl/blacklab [CODEMETA COMPOSITION (https://tools.clariah.nl/blacklab)] processed 2 new triples, total is now 5 Processing source #5 of 10 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.blacklab.codemeta.json NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically... Injected (possibly temporary) URI https://tools.clariah.nl/blacklab [CODEMETA COMPOSITION (https://tools.clariah.nl/blacklab)] processed 1 new triples, total is now 6 Processing source #6 of 10 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/40-gitapi.blacklab.codemeta.json Found main resource with URI https://tools.clariah.nl/blacklab/snapshot Injected (possibly temporary) URI https://tools.clariah.nl/blacklab [CODEMETA COMPOSITION (https://tools.clariah.nl/blacklab)] processed 15 new triples, total is now 20 Processing source #7 of 10 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/39-gitdate.blacklab.codemeta.json NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically... Injected (possibly temporary) URI https://tools.clariah.nl/blacklab [CODEMETA COMPOSITION (https://tools.clariah.nl/blacklab)] overriding old http://schema.org/dateCreated (2012-10-04T10:59:42Z -> 2012-10-04T03:59:43Z-0700) [CODEMETA COMPOSITION (https://tools.clariah.nl/blacklab)] overriding old http://schema.org/dateModified (2024-12-05T09:14:26Z -> 2022-10-06T13:08:42Z+0200) [CODEMETA COMPOSITION (https://tools.clariah.nl/blacklab)] processed 2 new triples, total is now 20 Processing source #8 of 10 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/29-license.blacklab.codemeta.json NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically... Injected (possibly temporary) URI https://tools.clariah.nl/blacklab [CODEMETA COMPOSITION (https://tools.clariah.nl/blacklab)] overriding old http://schema.org/license (http://spdx.org/licenses/Apache-2.0 -> Apache-2.0) [CODEMETA CORRECTION (https://tools.clariah.nl/blacklab)] automatically converting license to spdx URI [CODEMETA COMPOSITION (https://tools.clariah.nl/blacklab)] processed 1 new triples, total is now 20 Processing source #9 of 10 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/21-java.blacklab.codemeta.json Found main resource with URI https://tools.clariah.nl/nl.inl.blacklab.blacklab-all/3.0.1 Injected (possibly temporary) URI https://tools.clariah.nl/blacklab [CODEMETA COMPOSITION (nl.inl.blacklab.blacklab-all)] overriding old http://schema.org/description (Linguistic search for large annotated text corpora, based on Apache Lucene -> The parent project for BlackLab Core and BlackLab Server.) [CODEMETA COMPOSITION (nl.inl.blacklab.blacklab-all)] overriding old http://schema.org/name (BlackLab -> BlackLab Corpus Search) [CODEMETA COMPOSITION (nl.inl.blacklab.blacklab-all)] overriding old http://schema.org/producer (https://tools.clariah.nl/org/dutch-language-institute -> https://tools.clariah.nl/org/instituut-voor-nederlandse-taal-int) [CODEMETA COMPOSITION (nl.inl.blacklab.blacklab-all)] overriding old http://schema.org/version (v3.0.1 -> 3.0.1) [CODEMETA COMPOSITION (nl.inl.blacklab.blacklab-all)] processed 82 new triples, total is now 93 Processing source #10 of 10 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.blacklab.codemeta.json NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically... Injected (possibly temporary) URI https://tools.clariah.nl/blacklab [CODEMETA COMPOSITION (nl.inl.blacklab.blacklab-all)] processed 1 new triples, total is now 94 Remapping URI to (possibly) new identifier and version component: https://tools.clariah.nl/blacklab -> https://tools.clariah.nl/blacklab/3.0.1 [CODEMETA VALIDATION (blacklab)] done [CODEMETA ENRICHMENT (blacklab)] considering first author as maintainer VALIDATION https://tools.clariah.nl/blacklab/3.0.1 #1: Info: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata) VALIDATION https://tools.clariah.nl/blacklab/3.0.1 #2: Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (This is missing in the metadata) VALIDATION https://tools.clariah.nl/blacklab/3.0.1 #3: Warning: Documentation *SHOULD* be expressed (This is missing in the metadata) VALIDATION https://tools.clariah.nl/blacklab/3.0.1 #4: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata) VALIDATION https://tools.clariah.nl/blacklab/3.0.1 #5: Info: The funder *SHOULD* be acknowledged (This is missing in the metadata) VALIDATION https://tools.clariah.nl/blacklab/3.0.1 #6: Info: The technology readiness level *SHOULD* be expressed (This is missing in the metadata) -- end log -- [harvester info] Output written to /tmp/out/blacklab.codemeta.json [harvester info] <-- Finished processing blacklab (https://github.com/INL/BlackLab) [Mon Dec 9 03:02:21 UTC 2024]