lm_viz

Visualizing Language Models

Language models (e.g. character embeddings) are essential to succeed in NLP tasks. Especially when it comes to Part-of-Speech and Named Entity Recognition, tasks result in more precise models if supported by adequate language models already. Since the advent of word2vec and large transformer-based language models (such as BERT or GPT-3) a variety of specialized and fine-tuned language models is currently available. Despite the widespread use and the necessity when it comes to specific model training (e.g. for language entities with only sparse data), our understanding of the models themselves is limited at best. In order to strengthen our understanding of language models and to start the process of reflecting them, this challenge asks for creative ways of visualizing language models. We envision 3D-visualizations based on dimension reduction to identify the positioning of e.g. synonym/homonyms in vector spaces or listing of semantic fields (neighboring vector values). For context insensitive approaches (e.g. word2vec or GloVe) we imagine to use the fixed vectors and represent calculations in grids.

Goals of the Repository

See our poster for Bern Data Science Day.

Description of Pipeline

ToDo

Environment

ToDo

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
app		app
code		code
creating_lm		creating_lm
frontend		frontend
models		models
node-app		node-app
resources		resources
Dockerfile		Dockerfile
PosterBDSD.pdf		PosterBDSD.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lm_viz

Visualizing Language Models

Goals of the Repository

Description of Pipeline

Environment

About

Releases

Packages

Contributors 3

Languages

DHBern/lm_viz

Folders and files

Latest commit

History

Repository files navigation

lm_viz

Visualizing Language Models

Goals of the Repository

Description of Pipeline

Environment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages