Hacker News<p>>8 token/s DeepSeek R1 671B Q4_K_M with 1~2 Arc A770 on Xeon — <a href="https://github.com/intel/ipex-llm/blob/main/docs/mddocs/Quickstart/llamacpp_portable_zip_gpu_quickstart.md" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/intel/ipex-llm/blob</span><span class="invisible">/main/docs/mddocs/Quickstart/llamacpp_portable_zip_gpu_quickstart.md</span></a><br><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/DeepSeek" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DeepSeek</span></a> <a href="https://mastodon.social/tags/ArcA770" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArcA770</span></a> <a href="https://mastodon.social/tags/Xeon" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Xeon</span></a> <a href="https://mastodon.social/tags/Tokenization" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Tokenization</span></a> <a href="https://mastodon.social/tags/LLM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LLM</span></a> <a href="https://mastodon.social/tags/GitHub" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GitHub</span></a></p>