🏛️
Learning
Pinned Loading
-
SemiAnalysisAI/InferenceX
SemiAnalysisAI/InferenceX PublicOpen Source Continuous Inference Benchmark Research Platform — Kimi K2.7-Code, MiniMax M3, DeepSeekv4, GLM5 - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 & soon™ TPUv6e/v7/Trainium2/3
-
DeepSeek-v3-From-Scratch
DeepSeek-v3-From-Scratch PublicImplemented the DeepSeek v3 model from scratch which includes Multi-Head Latent Attention and MoE architecture
Python
-
DeepMatrixCapsules/DeepMatrixCapsules
DeepMatrixCapsules/DeepMatrixCapsules PublicDeep Matrix Capsules Implementation
-
-
-
Unsloth-Puzzles
Unsloth-Puzzles PublicA triton kernel for NF4 dequantization faster than HF and a gradient checkpointing implementation
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



