Qwen3.5-35B – 16GB GPU – 100T/s with 120K context AND vision enabled

by willfinger | View on Hacker News