Skip to content
@veldhub

VELD HUB

VELD: Versioned Executable Logic and Data

VELD

This github organization collects repositories that implement the VELD Design and adhere to its Metadata Schema.

metadata registry

Below is a list of VELD repositories and relevant metadata. Until a full platform is developed, this README serves as the pragmatic aggregation point.

metadata sections

data velds

code velds

chain velds

topic vocab

content vocab

file_type vocab

data velds

code velds

chain velds

topic vocab

  • Analysis
  • Bible Studies
  • Data Cleaning
  • Data Visualization
  • database
  • demo
  • Dependency Parsing
  • ETL
  • Evaluation
  • Grammatical Annotation
  • Lemmatization
  • Machine Learning
  • Named Entity Recognition
  • NLP
  • Part Of Speech
  • Preprocessing
  • RDF
  • Testing
  • Tokenization
  • triplestore
  • Universal Dependencies
  • Word Embeddings

content vocab

  • annotated literature
  • data visualization
  • download urls and target file names
  • enriched text
  • evaluation data
  • GloVe model
  • gold data
  • grammatically annotated text
  • inferenced NLP data
  • lemmatized text
  • lemmatizer
  • linguistic data
  • linguistically enriched text
  • log
  • metadata
  • ML gold data
  • model metadata
  • natural text
  • NER data
  • NER gold data
  • NER model
  • newspaper texts
  • NLP data
  • NLP gold data
  • NLP model
  • NLP statistics
  • NLP training data
  • Part Of Speech of text
  • raw text
  • RDF/XML
  • spaCy model
  • spacy training config
  • sparql query
  • statistics
  • TEI
  • tokenized text
  • tokenizer
  • Universal Dependencies of text
  • Word Embeddings
  • Word Embeddings model
  • Word Embeddings training data
  • Word Embeddings vectors

file_type vocab

  • bin
  • cfg
  • conllu
  • csv
  • fastText model
  • GloVe model
  • html
  • ini
  • json
  • pkl
  • png
  • rq
  • spaCy docbin
  • spaCy model
  • tsv
  • txt
  • udpipe model
  • word2vec model
  • xml
  • xslt
  • yaml

Popular repositories Loading

  1. veld_chain__eltec_udpipe_inference veld_chain__eltec_udpipe_inference Public

    chain velds using udpipe 1 to infer on five ELTeC corpora.

    XSLT

  2. veld_chain__mara_load_and_publish_models veld_chain__mara_load_and_publish_models Public

    Chain velds for publishing self-trained MARA models to huggingface.

    Python

  3. veld_chain__train_infer_wordembeddings_multiple_architectures__amc veld_chain__train_infer_wordembeddings_multiple_architectures__amc Public

    Chain velds encapsulating training and evaluating static word embedding architectures on the Austria Media Corpus.

    1

  4. veld_chain__apis_ner_evaluate_old_models veld_chain__apis_ner_evaluate_old_models Public

    Chain velds encapsulating evalution of old spacy models.

    Python

  5. veld_chain__apis_ner_transform_to_gold veld_chain__apis_ner_transform_to_gold Public

    Chain velds encapsulating extraction and conversion of gold data.

    Python

  6. veld_chain__train_spacy_apis_ner veld_chain__train_spacy_apis_ner Public

    Chain velds encapsulating a spacy NER training setup on APIS data.

    Jupyter Notebook

Repositories

Showing 10 of 60 repositories
  • veld_code__wordembeddings_preprocessing Public

    Code velds encapsulating preprocessing for training of wordembeddings.

    veldhub/veld_code__wordembeddings_preprocessing’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Mar 10, 2025
  • veldhub/veld_chain__dta_semantic_drift_analysis’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Mar 10, 2025
  • veld_chain__eltec_udpipe_inference Public

    chain velds using udpipe 1 to infer on five ELTeC corpora.

    veldhub/veld_chain__eltec_udpipe_inference’s past year of commit activity
    XSLT 0 MIT 0 0 0 Updated Mar 3, 2025
  • veld_code__xml_xslt_transformer Public

    Code veld encapsulating generic xml / xslt transformation.

    veldhub/veld_code__xml_xslt_transformer’s past year of commit activity
    Shell 0 MIT 0 0 0 Updated Mar 3, 2025
  • veldhub/veld_code__wordembeddings_postgres_db’s past year of commit activity
    Dockerfile 0 MIT 0 0 0 Updated Mar 3, 2025
  • .github Public

    info repo for veldhub org

    veldhub/.github’s past year of commit activity
    0 MIT 0 0 0 Updated Mar 3, 2025
  • veld_chain__demo_wordembeddings_multiarch Public

    A VELD demonstration, aggregating heterogeneous modular workflows into a cohesive reproducible pipeline.

    veldhub/veld_chain__demo_wordembeddings_multiarch’s past year of commit activity
    Jupyter Notebook 0 MIT 0 0 0 Updated Mar 3, 2025
  • veld_code__wordembeddings_evaluation Public

    Code velds encapsulating evaluation of wordembeddings trained by various architectures.

    veldhub/veld_code__wordembeddings_evaluation’s past year of commit activity
    C 0 MIT 0 0 0 Updated Mar 3, 2025
  • veldhub/veld_chain__compare_tokenizations’s past year of commit activity
    Jupyter Notebook 0 MIT 0 0 0 Updated Mar 3, 2025
  • veld_chain__automatic_tei-ification_of_gutenberg Public

    Chain velds encapsulating automatic tei conversion on gutenberg data:

    veldhub/veld_chain__automatic_tei-ification_of_gutenberg’s past year of commit activity
    0 MIT 0 0 0 Updated Mar 2, 2025

Top languages

Loading…

Most used topics

Loading…