

I’m mostly trying to describe a feeling I don’t hear named very often


I’m mostly trying to describe a feeling I don’t hear named very often


I’ll give that a shot.
I’m running it in docker because it’s running on a headless server with a boatload of other services. Ideally whatever I use will be accessible over the network.
I think at the time I started, not everything supported Intel cards, but it looks like llama-cli has support form Intel GPUs. I’ll give it a shot. Thanks!
Not to mention, if you try to swallow a potato whole (as one does), you risk choking to death.


Thanks for the link. I was gonna ask if you were a writer, heh.
I agree. The tone of the ads this year felt almost like lampshading. Like if we acknowledge the problem, we’re wise to what the audience is feeling, but we’re not going to do a damn thing to address it. It’s just something that needs to be done to make the ad feel remotely relevant.
AI is scary, but don’t be afraid of our surveillance device because we acknowledged that AI is scary
AI will sell you ads. Anyway, you’re watching an ad for AI
Work sucks amirite? Why not let us unemploy you?
There’s a wealth gap. Spend money on our stuff.
And I’m not going to even link the He Gets Us ads.
It was an especially interesting case because there was a question of whether the photographer lied about who actually took the picture. So he could either claim the monkey took it an lose the copyright or claim he took it and have it lose all value.


Thanks for taking the time.
So I’m not using a CLI. I’ve got the intelanalytics/ipex-llm-inference-cpp-xpu image running and hosting LLMs to be used by a separate open-webui container. I originally set it up with Deepseek-R1:latest per the tutorial to get the results above. This was straight out of the box with no tweaks.
The interface offers some controls settings (below screenshot). Is that what you’re talking about?

Those aren’t for sharpening. They’re for honing the blade. A sharp edge is thin enough to get bent out of shape during normal use, so the honing tip serves to straighten the edge, not sharpen it.
Mouse? I thought that was a koala all these times.



Well, not off to a great start.
To be clear, I think getting an LLM to run locally at all is super cool, but saying “go self hosted” sort of gloms over the fact that getting a local LLM to do anything close to what ChatGPT can do is a very expensive hobby.
Don’t worry. Her husband died in like 1985, and she lived off his government pension for 35 years.
My incredibly racist late grandmother asked me in 2020 if I was voting for “the other guy.”


Any suggestions on how to get these to gguf format? I found a GitHub project that claims to convert, but wondering if there’s a more direct way.


GO self-hosted,
So yours and another comment I saw today got me to dust off an old docker container I was playing with a few months ago to run deepseek-r1:8b on my server’s Intel A750 GPU with 8gb of VRAM. Not exactly top-of-the-line, but not bad.
I knew it would be slow and not as good as ChatGPT or whatever which I guess I can live with. I did ask it to write some example Rust code today which I hadn’t even thought to try and it worked.
But I also asked it to describe the characters in a popular TV show, and it got a ton of details wrong.
8b is the highest number of parameters I can run on my card. How do you propose someone in my situation run an LLM locally? Can you suggest some better models?
It was.
So was the episode of Silicon Valley that the above image is from. https://www.cracked.com/article_49720_kid-rocks-cameo-on-silicon-valley-is-going-viral-for-obvious-reasons.html
Nope. Mirrors show you what you looked like when you were 3-4 nanoseconds younger.


That’s not even close to the worst of it.
But a cattery couldn’t be used in a circuit. It only has a cathode.
Are you aware of all the parallels between TMNT and Daredevil?