Pinned Loading
-
huggingface/transformers
huggingface/transformers Public🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
-
vllm-project/llm-compressor
vllm-project/llm-compressor PublicTransformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
-
vllm-project/compressed-tensors
vllm-project/compressed-tensors PublicA safetensors extension to efficiently store sparse quantized tensors on disk
-
canada-quant/dsv4-pro-nvfp4-fp8-mtp
canada-quant/dsv4-pro-nvfp4-fp8-mtp PublicNVFP4-FP8 quantization of DeepSeek-V4-Pro with MTP retention (work in progress)
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

