Self-hosting LLMs

GreenSofaBed@lemmy.zip · 9 days ago

Self-hosting LLMs

Showroom7561@lemmy.ca · edit-2 9 days ago

You can run this right from Windows: https://jan.ai/

You’ll need a lot of RAM, and processing is decently fast, even on a basic laptop.

edit: holy hell. Grammar.

dangling_cat@lemmy.blahaj.zone · 8 days ago

Tip: you can copy and paste the Hugging Face link directly into the search box, and it will download the model automatically! Also, it’s pretty smart. It will load into your VRAM first, then your RAM. If you can fit everything into VRAM, you get the fastest speed. But even if you are using RAM, it’s not terribly bad; it’s still faster than you can read.

GreenSofaBed@lemmy.zip · 8 days ago

This is pretty cool!