#powerinfer
1 messages · Page 1 of 1 (latest)
PowerInfer looks promising, but they are not really being up front about everything. A quick glance would indicate;
- they have a fork of llama.cpp
- there are caveats eg "it is important to note that Powerinfer is great for certain hardware, but right not lacking for others."
- PI needs special, 'ReLU' models
- their 11x maybe be more marketing than truth (IMO)
llama.cpp is already considering merging some of the PI magic back into main llama.cpp - https://github.com/ggerganov/llama.cpp/pull/4543
GitHub
Writing a review of PowerInfer with a view to merge into llama.cpp.
References:
https://github.com/SJTU-IPADS/PowerInfer
https://ipads.se.sjtu.edu.cn/_media/publications/powerinfer-20231219.pdf
O...