Changelog¶

All notable changes to TurboQuantCPU will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[0.1.0] - 2026-03-25¶

This is a major release marking TurboQuantCPU as production-ready with comprehensive benchmarking, documentation, and marketing materials.

3-Model Benchmark Suite: Comprehensive benchmarks on Qwen3.5-0.8B, Llama-3.2-1B, Gemma-2-2b-it
4 Publication-Quality Plots: Compression vs Quality, Speed Overhead, Needle-in-Haystack, Competitor Comparison
Competitor Analysis: Honest comparison with llama.cpp, KIVI, KVQuant, H2O, SnapKV, PyramidKV
Needle-in-Haystack Test: Long-context retrieval benchmark at multiple depths
Benchmark Validation: Automated script validation ensuring all benchmarks run correctly

Comprehensive README: Clear value proposition, benchmark results with interpretations, all 4 plots embedded
Detailed Benchmarks Guide: What each metric means, how to interpret results, visualization explanations
Updated API Reference: Complete API documentation with examples
Marketing-Focused Content: Feature-benefit explanations, competitive positioning, use case guidance

Cleaned GGUF References: Removed incomplete GGUF support, focusing on HuggingFace Transformers
Enhanced Examples: 5 complete examples (getting started, quantization modes, long context, batch processing, API reference)
CI/CD Pipeline: GitHub Actions for automated testing
All 19 Tests Passing: Comprehensive correctness and integration tests

For detailed commit history, see GitHub commits.