Skip to content

timurci/llm-engineering

Repository files navigation

LLM Engineering Projects

This repository consists of a collection of my implementations to an llm engineering project list composed by Ahmad M. Osman

GitHub License

Roadmap

The following is an incomplete version of the composed list.

  1. Tokenization & embeddings (README)
    • Build a byte-pair encoder to train your own subword vocabulary
    • Implement a token visualizer to map chunks to IDs
    • One-hot encoding vs learned embeddings, plot cosine distances
  2. Positional embeddings (in progress)
    • Implement four demos: classic sinusoidal vs learned vs RoPE vs ALiBi
    • Animate a toy sequence being position-encoded in 3D
    • Ablate positions to see the attention collapse
  3. Self attention & multi-head attention
    • Hand-wire dot-product attention for one token
    • Scale to multi-head, plot per-head weight heatmaps
    • Mask out future tokens, verify causal property

About

A collection of LLM engineering projects

Resources

License

Stars

Watchers

Forks

Contributors

Languages