wandexer

"index annorepo container to elastic index"

Provided tools & services

wandexer

Type
  • Command-line Application
Executable name
wandexer

Tool suite: AnnoRepo

The following closely related tools are in a tool suite together with wandexer:

  • Web API
  • Experimental: The technology is implemented and ready for experimental settings (beta), but requires further work and validation.
  • Active: The project has reached a stable, usable state and is being actively developed.

AnnoRepo 0.8.0

Implementation of W3C Web Annotation Protocol (root project) [view more]
  • web-annotation
  • web-annotation-protocol
Created: 2022-03-24
Modified: 2025-08-25
  • Command-line Application
  • Experimental: The technology is implemented and ready for experimental settings (beta), but requires further work and validation.
  • Active: The project has reached a stable, usable state and is being actively developed.

annorepo-client 0.3.2

A Python client for accessing an AnnoRepo server [view more]
  • Os
  • Python
Created: 2022-04-07
Modified: 2025-11-12
  • WIP: Initial development is in progress, but there has not yet been a stable, usable release suitable for the public.
Created: 2021-02-25
Modified: 2026-04-20

Citation

You can cite this software using the following citation generated from its metadata:

(2026) wandexer 1.1.3 .
  • KNAW Humanities Cluster
.

Logs & Reviews

Name
Automatic software metadata validation report for wandexer 1.1.3
Author
  • codemetapy validator using software.ttl
Date
2026-04-23 03:23:20
Review
Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems

Validation of wandexer 1.1.3 was successful (score=3/5), but there are some warnings which should be addressed:

1. Info: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata)
2. Warning: Documentation *SHOULD* be expressed (This is missing in the metadata)
3. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)
4. Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)
5. Info: A research domain *SHOULD* be expressed as a category using the NWO Research Fields vocabulary, if applicable (This is missing in the metadata)
6. Info: A research activity *SHOULD* be expressed as a category using the TaDiRaH vocabulary (This is missing in the metadata)
Rating
★ ★ ★ ☆ ☆
(log file starts at Thu Apr 23 03:23:05 UTC 2026)

[harvester info] --> Processing wandexer (https://github.com/knaw-huc/wandexer) [Thu Apr 23 03:23:05 UTC 2026]

[harvester info] Git updating cached clone of https://github.com/knaw-huc/wandexer...

[harvester info] Found release v1.1.3

[harvester info] Using 'v1.1.3'

[harvester info] Git reference: v1.1.3

[harvester info] Scanning directory /tmp/codemeta-harvester.cache/wandexer for harvestable resources...

[harvester info] found python setup for wandexer, converting to codemeta

[harvester info] Looking for license....

[harvester info] Found license GPL-3.0-only

[harvester info] Getting contributors from git...

[harvester info] Getting top contributor from git...

[harvester info] Git top contributor HDJ <hayco.de.jong@di.huc.knaw.nl> will be assigned as author (and maintainer) if none are found in the metadata

[harvester info] Extracting last and first commit date from git log....

[harvester info] Date created: 2025-01-22T15:00:10Z+0100, date modified: 2026-03-09T15:18:02Z+0100

[harvester info] Querying Github/GitLab API (https://github.com/knaw-huc/wandexer)

[harvester info] Adding URL for found README: README.md

[harvester info] Found releaseNotes

[harvester info] Querying Zenodo API for DOI (access token provided)...

[harvester info] Looking for TRL information in README.md...

[harvester info] Looking for repostatus information in README.md...

[harvester info] Looking for continuous integration information in README.md...

[harvester info] Looking for documentation links in README.md...

[harvester info] Falling back to git tag (v1.1.3) if no version number is specified...

[harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)...

[harvester info] Inferred repostatus https://www.repostatus.org/#active

[harvester info] Looking for repostatus information in README.md in master branch...

[harvester info] Setting group AnnoRepo

[harvester info] Reconciliating: codemetapy  --baseuri https://tools.clariah.nl --baseuri https://tools.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "wandexer" --codeRepository "https://github.com/knaw-huc/wandexer" --validate /etc/software.ttl --released --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/wandexer.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-version.wandexer.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.wandexer.codemeta.json /tmp/codemeta-harvester.cache//tmp/90-authors.wandexer.codemeta.json /tmp/codemeta-harvester.cache//tmp/43-releasenotes.wandexer.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.wandexer.codemeta.json /tmp/codemeta-harvester.cache//tmp/40-gitapi.wandexer.codemeta.json /tmp/codemeta-harvester.cache//tmp/39-gitdate.wandexer.codemeta.json /tmp/codemeta-harvester.cache//tmp/32-contributors.wandexer.codemeta.json /tmp/codemeta-harvester.cache//tmp/29-license.wandexer.codemeta.json /tmp/codemeta-harvester.cache//tmp/20-python.wandexer.codemeta.json /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.wandexer.codemeta.json 

-- begin log --

Passed 11 files/sources but specified 0 input types! Automatically guessing types...

Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-version.wandexer.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/99-repostatus.wandexer.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/90-authors.wandexer.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/43-releasenotes.wandexer.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.wandexer.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/40-gitapi.wandexer.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/39-gitdate.wandexer.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/32-contributors.wandexer.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/29-license.wandexer.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/20-python.wandexer.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/04-applicationSuite.wandexer.codemeta.json', 'json')]

Adding to contextgraph: /tmp/turtle

Initial URI automatically generated, may be overriden later: https://tools.clariah.nl/wandexer

Processing source #1 of 11

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-version.wandexer.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/wandexer

[CODEMETA COMPOSITION (https://tools.clariah.nl/wandexer)] processed 1 new triples, total is now 2

Processing source #2 of 11

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.wandexer.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/wandexer

[CODEMETA COMPOSITION (https://tools.clariah.nl/wandexer)] processed 1 new triples, total is now 3

Processing source #3 of 11

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/90-authors.wandexer.codemeta.json

    Found main resource with URI https://tools.clariah.nl/wandexer.topcontributor/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/wandexer

[CODEMETA COMPOSITION (https://tools.clariah.nl/wandexer)] processed 8 new triples, total is now 10

Processing source #4 of 11

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/43-releasenotes.wandexer.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/wandexer

[CODEMETA COMPOSITION (https://tools.clariah.nl/wandexer)] processed 2 new triples, total is now 12

Processing source #5 of 11

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.wandexer.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/wandexer

[CODEMETA COMPOSITION (https://tools.clariah.nl/wandexer)] processed 1 new triples, total is now 13

Processing source #6 of 11

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/40-gitapi.wandexer.codemeta.json

    Found main resource with URI https://tools.clariah.nl/wandexer/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/wandexer

[CODEMETA COMPOSITION (https://tools.clariah.nl/wandexer)] processed 13 new triples, total is now 25

Processing source #7 of 11

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/39-gitdate.wandexer.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/wandexer

[CODEMETA COMPOSITION (https://tools.clariah.nl/wandexer)] overriding old http://schema.org/dateCreated (2025-04-15T11:05:55Z -> 2025-01-22T15:00:10Z+0100)

[CODEMETA COMPOSITION (https://tools.clariah.nl/wandexer)] overriding old http://schema.org/dateModified (2026-04-22T09:35:38Z -> 2026-03-09T15:18:02Z+0100)

[CODEMETA COMPOSITION (https://tools.clariah.nl/wandexer)] processed 2 new triples, total is now 25

Processing source #8 of 11

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/32-contributors.wandexer.codemeta.json

    Found main resource with URI https://tools.clariah.nl/wandexer.contributors/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/wandexer

[CODEMETA COMPOSITION (https://tools.clariah.nl/wandexer)] processed 26 new triples, total is now 46

Processing source #9 of 11

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/29-license.wandexer.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/wandexer

[CODEMETA COMPOSITION (https://tools.clariah.nl/wandexer)] overriding old http://schema.org/license (http://spdx.org/licenses/GPL-3.0-only -> GPL-3.0-only)

[CODEMETA CORRECTION (https://tools.clariah.nl/wandexer)] automatically converting license to spdx URI

[CODEMETA COMPOSITION (https://tools.clariah.nl/wandexer)] processed 1 new triples, total is now 46

Processing source #10 of 11

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/20-python.wandexer.codemeta.json

    Found main resource with URI https://tools.clariah.nl/wandexer/1.1.3

    Injected (possibly temporary) URI https://tools.clariah.nl/wandexer

[CODEMETA COMPOSITION (wandexer)] overriding old http://schema.org/author (https://tools.clariah.nl/stub/H-4407d79d624e3cb -> https://tools.clariah.nl/stub/H0d7a232a3598f5af)

[CODEMETA COMPOSITION (wandexer)] overriding old http://schema.org/codeRepository (https://github.com/knaw-huc/wandexer -> https://github.com/knaw-huc/peen-indexer)

[CODEMETA COMPOSITION (wandexer)] overriding old http://schema.org/description (Indexer for W3C Web Annotations as produced by AnnoRepo -> "index annorepo container to elastic index")

[CODEMETA COMPOSITION (wandexer)] overriding old https://codemeta.github.io/terms/developmentStatus (https://www.repostatus.org/#active -> https://www.repostatus.org/#wip)

[CODEMETA COMPOSITION (wandexer)] overriding old http://schema.org/version (v1.1.3 -> 1.1.3)

[CODEMETA COMPOSITION (wandexer)] processed 76 new triples, total is now 113

Processing source #11 of 11

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.wandexer.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/wandexer

[CODEMETA COMPOSITION (wandexer)] processed 1 new triples, total is now 114

Remapping URI to (possibly) new identifier and version component: https://tools.clariah.nl/wandexer -> https://tools.clariah.nl/wandexer/1.1.3

[CODEMETA VALIDATION (wandexer)] done

[CODEMETA ENRICHMENT (wandexer)] automatically adding programmingLanguage Python derived from runtimePlatform Python

[CODEMETA ENRICHMENT (wandexer)] automatically adding programmingLanguage Python derived from runtimePlatform Python

[CODEMETA ENRICHMENT (wandexer)] automatically adding programmingLanguage Python derived from runtimePlatform Python

[CODEMETA ENRICHMENT (wandexer)] considering first author as maintainer

VALIDATION https://tools.clariah.nl/wandexer/1.1.3 #1: Info: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/wandexer/1.1.3 #2: Warning: Documentation *SHOULD* be expressed (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/wandexer/1.1.3 #3: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/wandexer/1.1.3 #4: Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/wandexer/1.1.3 #5: Info: A research domain *SHOULD* be expressed as a category using the NWO Research Fields vocabulary, if applicable (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/wandexer/1.1.3 #6: Info: A research activity *SHOULD* be expressed as a category using the TaDiRaH vocabulary (This is missing in the metadata)

-- end log --

[harvester info] Output written to /tmp/out/wandexer.codemeta.json

[harvester info] <-- Finished processing wandexer (https://github.com/knaw-huc/wandexer) [Thu Apr 23 03:23:21 UTC 2026]

        

Metadata Properties

Version
1.1.3 (release notes)
Interface types
  • Command-line Application
Software website
Source code repository
 https://github.com/knaw-huc/wandexer  Stars are an indicator of the popularity of this project on GitHub
Category
  • Scientific/Engineering > Information Analysis
Keywords
  • text
Development Status
  • Experimental: The technology is implemented and ready for experimental settings (beta), but requires further work and validation.
  • WIP: Initial development is in progress, but there has not yet been a stable, usable release suitable for the public.
Issue Tracker (Support)
https://github.com/knaw-huc/wandexer/issues  The number of open issues on the issue tracker  The number of closes issues on the issue tracker
Documentation
License
Author(s)
Maintainer(s)
Contributor(s)
Producer
Programming Language
  • Python
Runtime Platform
  • Python 3
  • Python 3 Only
  • Python Implementation CPython
Operating System
  • POSIX > Linux
Software dependencies
  • annorepo-client
  • elasticsearch
  • loguru
  • PyYAML
  • requests
  • wheel
Metadata validation
★ ★ ★ ☆ ☆
Created
2025-01-22 15:00:10 +0100
Last modified
2026-03-09 15:18:02 +0100  Last commit (main branch). Gives an indication of project development activity and rough indication of how up-to-date the latest release is.  Number of commits since the last release. Gives an indication of project development activity and rough indication of how up-to-date the latest release is.