參考這篇文章: https://community.frame.work/t/dgx-spark-vs-strix-halo-initial-impressions/77055
得到一個對比表格(裏面一些數據是我自己測試,有些則從網上找的,準確性請自行衡量):
| AMD | GB10 | |
| LLaMA 3.3 8B | 25 | |
| LLaMA 3.3 70B-Instruct | 2.8 | |
| qwen2 14B Q5K_M | 20 | |
| qwen3.5 9B | 26 | |
| Qwen3.5-27B.Q4_K_M | 11 | 7.2 |
| Qwen3.5-24B-A3B-Claude | 48 | |
| Qwen3.5-35B-A3B | 50 | 46 |
| Llama-3.1-tulu-3-8B-Q8 | 25.52 | |
| gpt-oss-120b-Q4_K_S | 45 | 51.5 |
| llama_server_Qwen3_VL_8B_Instruct_UD_Q4_K_XL | 37.06 | 35.48 |
| qwen3moe 30B.A3B Q8_0 | 40 | |
| qwen2 7B Q8_0 | 27 | |
| glm4moe 106B.A12B Q4_K | 18 |