[Benchmark] qingming-engine Vector Search Performance: RX 7900 XTX 24G Shows Excellent Results | AMD Developer Community | Page 1

patent rune Jan 21, 2026, 1:48 PM

#

SIFT-1M Results: QINGMING-ENGINE on AMD RX 7900 XTX achieves a throughput of 6,275.72 QPS with a P99 latency of 11.214 ms and a recall rate of 99.26%@1/100%@10 for searching 1 million 128-dimensional vectors.
GIST-10M Subset Results: For a 1-million subset of 960-dimensional vectors, the system maintains 470.29 QPS with a P99 latency of 25.755 ms and a high recall of 99.40%@1/100%@10.

The author has open-sourced it. Anyone want to give it a try?
https://github.com/uulong950/qingming-flat/blob/main/README.md

GitHub

qingming-flat/README.md at main · uulong950/qingming-flat

Qingming(青冥)-Flat 是一个全平台高性能暴力向量搜索引擎. Contribute to uulong950/qingming-flat development by creating an account on GitHub.

signal raven Jan 23, 2026, 7:00 AM

#

Hey, nice work! 🔥 6K+ QPS on SIFT-1M with that recall rate is solid.

Cool to see more vector search stuff running on RDNA3 👀

raw mulch Jan 23, 2026, 7:35 AM

#

sounds very impressive

patent rune Jan 23, 2026, 12:51 PM

#

signal raven Hey, nice work! 🔥 6K+ QPS on SIFT-1M with that recall rate is solid. Cool to s...

The previous one was a brute-force implementation. I also have an ANN version, which delivers explosive performance with a QPS of 480k.recall@1 97%,p99=2ms on this device😆

#

rn_image_picker_lib_temp_73aa593e-184a-4c76-a9e4-00dc71edcf16.jpg

#

the gist 960

rn_image_picker_lib_temp_368b7214-5456-4950-80f5-03666b8d76d1.jpg

#[Benchmark] qingming-engine Vector Search Performance: RX 7900 XTX 24G Shows Excellent Results