D-Lab - Word Embeddings Tutorial

Human Embeddings

Before we dive into complex models, let's try something simpler. Look at this plot and tell me: where are "chicken" and "king"?

What are the X₁ and X₂ coordinates?

Look at the plot on the right. Where are "chicken" and "king" on this coordinate system?

chicken

X₁: X₂:

king

X₁: X₂:

Word Embedding Space

Understanding Meaning Dimensions

Each coordinate axis represents a meaning dimension - a continuous feature that tells us something about the word's meaning.

We want to represent words with a number of continuous features which tell us something about the meaning of the words

i.e. an embedding vector

X₂ Axis: Power/Status Dimension

↑

More power (royalty: king, queen, monarch)

Legal importance dimension?

↓

Less power (animals: rooster, chicken, hen)

X₁ Axis: Gender Dimension

←

More male (man, king, rooster)

→

More female (woman, queen, hen)

Why Start With Just 2 Dimensions?

Real word embeddings like those in GPT or Word2Vec use 300+ dimensions - imagine hundreds of meaning aspects like formality, emotion, concreteness, etc. We start with just 2 dimensions (power and gender) to build your intuition.

Our Teaching Example

              chicken = (3.5, 2.0)
            

Just 2 numbers: easy to visualize!

Real Word2Vec

              chicken = [0.23, -0.45, 0.12, 0.67, -0.89, 0.34, -0.12, 0.56, ...]
            

300 numbers: captures nuanced meaning!

Same principles, different scale! Whether it's 2 dimensions or 300, the core concept is identical: words become vectors of numbers that capture semantic meaning.

Key insight:

Each dimension captures a different aspect of meaning. Instead of just saying "king is royal", we can say "king has a power value of 5.0 and a gender value of 1.0". This turns fuzzy concepts into precise numbers that computers can calculate with!

What are we learning?

Now that you've seen how words can be represented as coordinates, let's explore how computers learn these representations automatically! Our teaching example is loosely based on CBOW (Continuous Bag of Words), which learns word meanings by predicting a target word from its surrounding context.

Learning Objective: CBOW Model

Given context words (like "the ___ sat on the"), the computer predicts the missing word ("cat"). By training on this task, words that appear in similar contexts develop similar numerical representations, capturing semantic relationships.

Why CBOW works:

Words that appear in similar contexts tend to have similar meanings. "cat" and "dog" both appear after "the" and before "sat", so they develop similar embeddings. This distributional hypothesis forms the foundation of modern word embeddings.

Interpretability

Now let's see what real word embeddings actually look like! In practice, embeddings have hundreds of dimensions (300-1024 is common). While most dimensions are hard to interpret, some clearly capture human-understandable concepts like gender, age, or social status.

Key Insight: Emergent Interpretability

Even though computers learn embeddings without being told what concepts to capture, some dimensions naturally emerge that correspond to meaningful semantic properties. This shows that mathematical word representations can capture real-world knowledge!

Interactive Real Embeddings Matrix

Click on dimension buttons to see how different concepts light up across words. Colors represent embedding values: red = high positive, blue = high negative, white = near zero.

Understanding the Matrix:

Rows = Different words | Columns = Individual embedding dimensions (300+ total) | Colors = Strength and direction of each feature

Notice how certain dimensions consistently activate for semantically similar words - this is how computers learn to group related concepts!

From Toy Examples to Real AI

This is exactly how large language models like GPT work internally - they learn hundreds or thousands of interpretable and uninterpretable features that capture all aspects of human language and knowledge. Your 2D/3D understanding scales to these high-dimensional spaces!

Summary

This demonstration illustrated how computers learn word relationships through predictive modeling:

Training process: Random embeddings gradually improve through prediction tasks
Semantic clustering: Words appearing in similar contexts develop similar numerical representations
Contextual learning: The training objective forces related words to cluster together
Vector semantics: Word meanings become mathematical objects that can be manipulated computationally

By training on the simple context prediction task, the model discovers meaningful semantic relationships. This approach forms the foundation of modern natural language processing and large language models.

How Computers Learn Word Meanings

Human Embeddings

What are the X₁ and X₂ coordinates?

Word Embedding Space

Congratulations, you just did text embeddings!

Understanding Meaning Dimensions

We want to represent words with a number of continuous features which tell us something about the meaning of the words

X₂ Axis: Power/Status Dimension

X₁ Axis: Gender Dimension

Why Start With Just 2 Dimensions?

Our Teaching Example

Real Word2Vec

Key insight:

What are we learning?

Learning Objective: CBOW Model

Why CBOW works:

Our Training Dataset

Why word IDs matter:

1Breaking Down the Sentence

Process explanation:

2Training the Model

What to observe during training:

How training works:

Training Results: Watch Embeddings Learn!

Word Embedding Matrix

2D Semantic Visualization

What you're seeing:

Interpretability

Key Insight: Emergent Interpretability

Understanding the Matrix:

From Toy Examples to Real AI

Summary