this post was submitted on 02 Feb 2025
231 points (97.1% liked)

United States | News & Politics

2211 readers
1180 users here now

Welcome to !usa@midwest.social, where you can share and converse about the different things happening all over/about the United States.

If you’re interested in participating, please subscribe.

Rules

Be respectful and civil. No racism/bigotry/hateful speech.

Post anything related to the United States.

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] BreadstickNinja@lemmy.world 1 points 1 day ago

There are finetunes of Llama, Qwen, etc., based on DeepSeek that implement the same pre-response thinking logic, but they are ultimately still the smaller models with some tuning. If you want to run locally and don't have tens of thousands to throw at datacenter-scale GPUs, those are your best option, but they differ from what you'd get in the Deepseek app.