Ditching the cloud for local AI — how I use two mini PCs to process millions of tokens a day and save money on costly API fees
For this kind of reading, thinking, analyzing, and re-presenting, local models work brilliantly. They have high throughput but are working in the background, meaning that the slower time to first token that many local LL…









