PewDiePie goes all-in on self-hosting AI using modded GPUs, with plans to build his own model soon — YouTuber pits multiple chatbots against each other to find

PewDiePie goes all-in on self-hosting AI using modded GPUs, with plans to build his own model soon — YouTuber pits multiple chatbots against each other to find

Pewds went on a tangent about how our data isn't really ours and that he's often spooked by AI knowing things about him in chat that he previously talked about. Despite deleting the chats, the data remains and is still used to train models unless you actively remove it from a company’s servers. This is where connecting your local data to the AI becomes a game-changer. Through RAG, Felix demonstrated that the model can retrieve information locally from his computer, so much so that it even knew things like his address or phone number.

PewDiePie just vibe-coded his own Chat UI, built an army of chatbots for majority voting and gave them all RAG, DeepResearch and audio outputnaturally, he only uses chinese Qwen models and runs them on his local PC with 8x modded chinese 48GB 4090s and 2x RTX 4000 Adahis army… pic.twitter.com/vS6DlPFwdQ October 31, 2025

This is where the general experimentation stopped, and the last few minutes of the video devolved into what our future sentient AI overlords might call “morally questionable.” Felix built an army of chatbots that all convene to provide answers to a single prompt. Those responses would then be voted on in a democratic process, with the weakest chatbots being eliminated from the “council.”

Eventually, the council learned that its members would be removed if they failed, and the AI became so smart that it colluded against Pewds, strategizing to game the system and avoid being erased. The solution was simple: switch to a smaller model with fewer parameters, and the bots once again fell victim to the circus.

From this came the idea of “The Swarm” — a collection of dozens of AIs running at once using 2B-parameter models. Pewds said he didn't realize he could run more than one AI on one GPU, which led to the creation of 64 of them across his entire stack. It was so over-the-top that the web UI eventually crashed. On the flip side, this gave Felix the idea of creating his own model.

(Image credit: PewDiePie on YouTube) (Image credit: PewDiePie on YouTube) The Swarm was great at collecting data, which Pewds says he’ll use to “create his own Palantir,” a project he teased for a future video. With this came the realization that smaller models are often more efficient; they’re fast and light, and, when combined with search and RAG, can punch well above their weight. Felix ended the video by reminding viewers that you don't need a beast PC to run AI models, and that he hopes to share his own soon for anyone to self-host.

Follow Tom's Hardware on Google News , or add us as a preferred source , to get our latest news, analysis, & reviews in your feeds.

Hassam Nasir Social Links Navigation Contributing Writer Hassam Nasir is a die-hard hardware enthusiast with years of experience as a tech editor and writer, focusing on detailed CPU comparisons and general hardware news. When he’s not working, you’ll find him bending tubes for his ever-evolving custom water-loop gaming rig or benchmarking the latest CPUs and GPUs just for fun.

circadia Pewds genuinely has improved as a person, huh… back in the day he was definitely really edgy and racist. Reply

Key considerations

  • Investor positioning can change fast
  • Volatility remains possible near catalysts
  • Macro rates and liquidity can dominate flows

Reference reading

More on this site

Informational only. No financial advice. Do your own research.

Leave a Comment