working-group-ideas
Maxwell Equations Simulator
Multi-GPU
Add Support for <new type> KV Cache Quantization in TorchAO
QuantizationSparsity
Optimize Quantization Settings to Fit a Given VRAM Budget
QuantizationSparsity
Add an activation sparsity kernel to TorchAO
QuantizationSparsity
Develop Fused Quantized GEMM/GEMV with LoRA
QuantizationSparsity
Implement an LUT-based n-bit Quantization (nf format) Fused Matmul Kernel
QuantizationSparsity
Develop an A16W3 (mixed fp16 x 3-bit) Fused Matmul Kernel
QuantizationSparsity