A Blacklab Server CLARIN FCS 2.0 endpoint

CLARIAH Federated content search corpora, developed by the Dutch Language Institute (INT), is a service to enable searching in multiple Dutch corpora at the same time. This application implements the CLARIN FCS 2.0 specification on top of Dutch language corpora. This repository hosts the source code.

Provided tools & services

Dutch FCS endpoints hosted at INT

CLARIAH Federated content search backends - instances for several Dutch corpora
Type
  • Unknown

FCS Aggregator

The Aggregator application is a part of the CLARIN-FCS common federated content search infrastructure. It serves as a user interface to perform queries to CLARIN-resources and display search results. The Aggregator communicates with components called endpoints, which are provided as a service by all centres who participate in the federated content search. Each endpoint provides access to one or more searchable resources. The user can select a specific resource or resources, based on the resource name or on the language, or search through all of them. The content of these resources is searched with the query supplied to the endpoint. The endpoint returns results to this query and the aggregator collects the responses from all the endpoints and displays them to the user.
Type
  • Web Application

Tool suite: Blacklab & Corpus Search

The following closely related tools are in a tool suite together with A Blacklab Server CLARIN FCS 2.0 endpoint:

  • Active: The project has reached a stable, usable state and is being actively developed.
Created: 2012-10-04
Modified: 2022-10-06
  • Web Application
  • Active: The project has reached a stable, usable state and is being actively developed.

INT Corpus Frontend 3.1.1

A web application to search corpora through the BlackLab Server web service. [view more]
  • corpus
Created: 2014-03-19
Modified: 2024-02-02

Citation

You can cite this software using the following citation generated from its metadata:

Logs & Reviews

Name
Automatic software metadata validation report for A Blacklab Server CLARIN FCS 2.0 endpoint 0.1
Author
  • codemetapy validator using software.ttl
Date
2024-07-20 03:03:00
Review
Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems

Validation of A Blacklab Server CLARIN FCS 2.0 endpoint 0.1 was successful (score=4/5), but there are some remarks which you may or may not want to address:

1. Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (The metadata does express this currently, but something is wrong in the way it is expressed. Is the type/class valid?)
2. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)
3. Info: The technology readiness level *SHOULD* be expressed (This is missing in the metadata)
Rating
★ ★ ★ ★ ☆
(log file starts at Sat Jul 20 03:02:49 UTC 2024)

[harvester info] --> Processing clariah-fcs-endpoints (https://github.com/INL/clariah-fcs-endpoints) [Sat Jul 20 03:02:49 UTC 2024]

[harvester info] Git updating cached clone of https://github.com/INL/clariah-fcs-endpoints...

[harvester info] No releases found, falling back to default git branch!

[harvester info] Using 'master'

[harvester info] Git reference: master

[harvester info] Scanning directory /tmp/codemeta-harvester.cache/clariah-fcs-endpoints for harvestable resources...

[harvester info] found codemeta-harvest.json for clariah-fcs-endpoints (md5sum 751b47d38ed83d33e0e94cff74870c84); values in here take precendence over (override) those in later detection stages

[harvester info] found pom.xml (Java/Maven) for clariah-fcs-endpoints, converting to codemeta

[harvester info] Looking for license....

[harvester info] Found license GPL-3.0-only

[harvester info] Getting contributors from git...

[harvester info] No git contributors found

[harvester info] Getting top contributor from git...

[harvester info] Git top contributor  will be assigned as author (and maintainer) if none are found in the metadata

[harvester info] Extracting last and first commit date from git log....

[harvester info] Date created: 2016-09-11T19:38:43Z+0200, date modified: 2023-05-10T15:46:27Z+0200

[harvester info] Querying Github/GitLab API (https://github.com/INL/clariah-fcs-endpoints)

[harvester info] Adding URL for found README: README.md

[harvester info] Looking for TRL information in README.md...

[harvester info] Looking for repostatus information in README.md...

[harvester info] Looking for continuous integration information in README.md...

[harvester info] Looking for documentation links in README.md...

[harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)...

[harvester info] Inferred repostatus https://www.repostatus.org/#inactive

[harvester info] Setting group Blacklab & Corpus Search

[harvester info] Reconciliating: codemetapy  --baseuri https://tools.clariah.nl --baseuri https://tools.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "clariah-fcs-endpoints" --codeRepository "https://github.com/INL/clariah-fcs-endpoints" --validate /etc/software.ttl --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/clariah-fcs-endpoints.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.clariah-fcs-endpoints.codemeta.json /tmp/codemeta-harvester.cache//tmp/90-authors.clariah-fcs-endpoints.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.clariah-fcs-endpoints.codemeta.json /tmp/codemeta-harvester.cache//tmp/40-gitapi.clariah-fcs-endpoints.codemeta.json /tmp/codemeta-harvester.cache//tmp/39-gitdate.clariah-fcs-endpoints.codemeta.json /tmp/codemeta-harvester.cache//tmp/29-license.clariah-fcs-endpoints.codemeta.json /tmp/codemeta-harvester.cache//tmp/21-java.clariah-fcs-endpoints.codemeta.json /tmp/codemeta-harvester.cache//tmp/10-harvest.clariah-fcs-endpoints.codemeta.json /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.clariah-fcs-endpoints.codemeta.json 

-- begin log --

Passed 9 files/sources but specified 0 input types! Automatically guessing types...

Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-repostatus.clariah-fcs-endpoints.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/90-authors.clariah-fcs-endpoints.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.clariah-fcs-endpoints.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/40-gitapi.clariah-fcs-endpoints.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/39-gitdate.clariah-fcs-endpoints.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/29-license.clariah-fcs-endpoints.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/21-java.clariah-fcs-endpoints.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/10-harvest.clariah-fcs-endpoints.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/04-applicationSuite.clariah-fcs-endpoints.codemeta.json', 'json')]

Adding to contextgraph: /tmp/turtle

Initial URI automatically generated, may be overriden later: https://tools.clariah.nl/clariah-fcs-endpoints

Processing source #1 of 9

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.clariah-fcs-endpoints.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/clariah-fcs-endpoints

[CODEMETA COMPOSITION (https://tools.clariah.nl/clariah-fcs-endpoints)] processed 1 new triples, total is now 2

Processing source #2 of 9

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/90-authors.clariah-fcs-endpoints.codemeta.json

    Found main resource with URI https://tools.clariah.nl/clariah-fcs-endpoints.topcontributor/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/clariah-fcs-endpoints

[CODEMETA COMPOSITION (https://tools.clariah.nl/clariah-fcs-endpoints)] processed 1 new triples, total is now 2

Processing source #3 of 9

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.clariah-fcs-endpoints.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/clariah-fcs-endpoints

[CODEMETA COMPOSITION (https://tools.clariah.nl/clariah-fcs-endpoints)] processed 1 new triples, total is now 3

Processing source #4 of 9

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/40-gitapi.clariah-fcs-endpoints.codemeta.json

    Found main resource with URI https://tools.clariah.nl/clariah-fcs-endpoints/snapshot

    Injected (possibly temporary) URI https://tools.clariah.nl/clariah-fcs-endpoints

[CODEMETA COMPOSITION (https://tools.clariah.nl/clariah-fcs-endpoints)] processed 16 new triples, total is now 18

Processing source #5 of 9

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/39-gitdate.clariah-fcs-endpoints.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/clariah-fcs-endpoints

[CODEMETA COMPOSITION (https://tools.clariah.nl/clariah-fcs-endpoints)] overriding old http://schema.org/dateCreated (2017-09-03T19:25:10Z -> 2016-09-11T19:38:43Z+0200)

[CODEMETA COMPOSITION (https://tools.clariah.nl/clariah-fcs-endpoints)] overriding old http://schema.org/dateModified (2024-01-22T17:15:05Z -> 2023-05-10T15:46:27Z+0200)

[CODEMETA COMPOSITION (https://tools.clariah.nl/clariah-fcs-endpoints)] processed 2 new triples, total is now 18

Processing source #6 of 9

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/29-license.clariah-fcs-endpoints.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/clariah-fcs-endpoints

[CODEMETA COMPOSITION (https://tools.clariah.nl/clariah-fcs-endpoints)] overriding old http://schema.org/license (http://spdx.org/licenses/GPL-3.0-only -> GPL-3.0-only)

[CODEMETA CORRECTION (https://tools.clariah.nl/clariah-fcs-endpoints)] automatically converting license to spdx URI

[CODEMETA COMPOSITION (https://tools.clariah.nl/clariah-fcs-endpoints)] processed 1 new triples, total is now 18

Processing source #7 of 9

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/21-java.clariah-fcs-endpoints.codemeta.json

    Found main resource with URI https://tools.clariah.nl/org.ivdnt.fcs.endpoint.clariah-fcs-endpoints/0.1

    Injected (possibly temporary) URI https://tools.clariah.nl/clariah-fcs-endpoints

[CODEMETA COMPOSITION (org.ivdnt.fcs.endpoint.clariah-fcs-endpoints)] overriding old http://schema.org/description (REST endpoints for CLARIAH Federated Content Search -> CLARIAH Federated content search for Dutch corpora, developed by the Dutch Language Institute (INT), is a service to enable searching in multiple Dutch corpora at the same time according to the CLARIN FCS 2.0 specification on top of Dutch language corpora. This repository hosts the source code.)

[CODEMETA COMPOSITION (org.ivdnt.fcs.endpoint.clariah-fcs-endpoints)] overriding old http://schema.org/license (http://spdx.org/licenses/GPL-3.0-only -> http://www.gnu.org/licenses/gpl.txt)

[CODEMETA COMPOSITION (org.ivdnt.fcs.endpoint.clariah-fcs-endpoints)] overriding old http://schema.org/name (clariah-fcs-endpoints -> A Blacklab Server CLARIN FCS 2.0 endpoint)

[CODEMETA COMPOSITION (org.ivdnt.fcs.endpoint.clariah-fcs-endpoints)] processed 80 new triples, total is now 93

Processing source #8 of 9

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/10-harvest.clariah-fcs-endpoints.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/clariah-fcs-endpoints

[CODEMETA COMPOSITION (org.ivdnt.fcs.endpoint.clariah-fcs-endpoints)] overriding old http://schema.org/description (CLARIAH Federated content search for Dutch corpora, developed by the Dutch Language Institute (INT), is a service to enable searching in multiple Dutch corpora at the same time according to the CLARIN FCS 2.0 specification on top of Dutch language corpora. This repository hosts the source code. -> CLARIAH Federated content search corpora, developed by the Dutch Language Institute (INT), is a service to enable searching in multiple Dutch corpora at the same time. This application implements the CLARIN FCS 2.0 specification on top of Dutch language corpora. This repository hosts the source code.)

[CODEMETA COMPOSITION (org.ivdnt.fcs.endpoint.clariah-fcs-endpoints)] overriding old http://schema.org/license (http://www.gnu.org/licenses/gpl.txt -> https://www.gnu.org/licenses/gpl-3.0.html)

[CODEMETA COMPOSITION (org.ivdnt.fcs.endpoint.clariah-fcs-endpoints)] overriding old http://schema.org/producer (https://tools.clariah.nl/org/dutch-language-institute -> https://tools.clariah.nl/stub/H12a879353819ffa0)

[CODEMETA COMPOSITION (org.ivdnt.fcs.endpoint.clariah-fcs-endpoints)] overriding old http://schema.org/keywords (fcs -> corpus search)

[CODEMETA COMPOSITION (org.ivdnt.fcs.endpoint.clariah-fcs-endpoints)] overriding old http://schema.org/keywords (clariah -> corpus search)

[CODEMETA COMPOSITION (org.ivdnt.fcs.endpoint.clariah-fcs-endpoints)] overriding old http://schema.org/keywords (corpus -> corpus search)

[CODEMETA COMPOSITION (org.ivdnt.fcs.endpoint.clariah-fcs-endpoints)] processed 41 new triples, total is now 127

Processing source #9 of 9

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.clariah-fcs-endpoints.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.clariah.nl/clariah-fcs-endpoints

[CODEMETA COMPOSITION (org.ivdnt.fcs.endpoint.clariah-fcs-endpoints)] processed 1 new triples, total is now 128

Remapping URI to (possibly) new identifier and version component: https://tools.clariah.nl/clariah-fcs-endpoints -> https://tools.clariah.nl/clariah-fcs-endpoints/0.1

[CODEMETA VALIDATION (clariah-fcs-endpoints)] done

[CODEMETA ENRICHMENT (clariah-fcs-endpoints)] adding author https://tools.clariah.nl/stub/H6a6722779f55f3a9 as contributor

[CODEMETA ENRICHMENT (clariah-fcs-endpoints)] adding author https://tools.clariah.nl/stub/H-6e0485838a055eb7 as contributor

[CODEMETA ENRICHMENT (clariah-fcs-endpoints)] adding author https://tools.clariah.nl/stub/H0039b701642830ef as contributor

VALIDATION https://tools.clariah.nl/clariah-fcs-endpoints/0.1 #1: Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (The metadata does express this currently, but something is wrong in the way it is expressed. Is the type/class valid?)

VALIDATION https://tools.clariah.nl/clariah-fcs-endpoints/0.1 #2: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)

VALIDATION https://tools.clariah.nl/clariah-fcs-endpoints/0.1 #3: Info: The technology readiness level *SHOULD* be expressed (This is missing in the metadata)

-- end log --

[harvester info] Output written to /tmp/out/clariah-fcs-endpoints.codemeta.json

[harvester info] Harvesting remote service URL https://portal.clarin.inl.nl/fcscorpora/clariah-fcs-endpoints/sru for clariah-fcs-endpoints: codemetapy  --baseuri https://tools.clariah.nl --baseuri https://tools.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl -O "/tmp/codemeta-harvester.cache//tmp/clariah-fcs-endpoints.codemeta.json" "/tmp/out/clariah-fcs-endpoints.codemeta.json" "https://portal.clarin.inl.nl/fcscorpora/clariah-fcs-endpoints/sru"

[harvester info] <-- Finished processing clariah-fcs-endpoints (https://github.com/INL/clariah-fcs-endpoints) [Sat Jul 20 03:03:04 UTC 2024]

        

Metadata Properties

Version
0.1
Interface types
  • Web Application
Source code repository
 https://github.com/INL/clariah-fcs-endpoints  Stars are an indicator of the popularity of this project on GitHub
Keywords
  • BlackLab
  • CLARIN
  • corpus search
  • FCS 2.0
  • Federated Content Search
  • Nederlab
Development Status
  • Inactive: The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows.
Issue Tracker (Support)
https://github.com/INL/clariah-fcs-endpoints/issues  The number of open issues on the issue tracker  The number of closes issues on the issue tracker
Documentation
License
Author(s)
Maintainer(s)
Contributor(s)
Producer
Programming Language
  • Java
Continuous Integration Tests
https://github.com/INL/clariah-fcs-endpoints
Runtime Platform
  • Java
Software dependencies
  • fcs-simple-endpoint
  • jackson-databind
  • json-path
  • json-simple
  • junit
  • log4j-core
  • log4j-slf4j-impl
  • maven-plugin-testing-harness
  • servlet-api
  • test-jetty-servlet
Metadata validation
★ ★ ★ ★ ☆
Created
2016-09-11 19:38:43 +0200
Last modified
2023-05-10 15:46:27 +0200  Last commit (main branch). Gives an indication of project development activity and rough indication of how up-to-date the latest release is.  Number of commits since the last release. Gives an indication of project development activity and rough indication of how up-to-date the latest release is.