Pumpkin Escobar

Pumpkin Escobar@lemmy.world · 5 days ago

DNFTA

Pumpkin Escobar@lemmy.world · 5 days ago

But, I thought Trump was all about state’s rights? This is very confusing. /s

Pumpkin Escobar@lemmy.world · 6 days ago

Maybe he should have removed the python the first time he found him in his toilet?

Lifts lid, yep, he’s still there

/s just something about the wording of the title sounded funny

Pumpkin Escobar@lemmy.world · 14 days ago

Bench warrant, let’s do this!

Pumpkin Escobar@lemmy.world · 16 days ago

Pumpkin Escobar@lemmy.world · 27 days ago

If you go, definitely stay at Four Seasons Total Landscaping next door, best accommodations around and their convention spaces are great for any press conferences you might need to hastily put together.

Pumpkin Escobar@lemmy.world · 1 month ago

I used to think they were bots. I still do, but I used to, too.

Pumpkin Escobar@lemmy.world · 1 month ago

https://youtu.be/_dhrqnDS_oU?si=0aDIGNwf4dusciEJ

Pumpkin Escobar@lemmy.world · edit-2 2 months ago

First a caveat/warning - you’ll need a beefy GPU to run larger models, there are some smaller models that perform pretty well.

Adding a medium amount of extra information for you or anyone else that might want to get into running models locally

Tools

Ollama - great app for downloading/managing/running models locally
OpenWebUI - A web app that provides a UI like the ChatGPT web app, but can use local models
continue.dev - A VS Code extension that can use ollama to give a github copilot-like AI assistant running against a local model (can also connect to Anthropic Claude, etc…)

Models

If you look at https://ollama.com/library?sort=featured you can see models

Model size is measured by parameter count. Generally higher parameter models are better (more “smart”, more accurate) but it’s very challenging/slow to run anything over 25b parameters on consumer GPUs. I tend to find 8-13b parameter models are a sort of sweet spot, the 1-4b parameter models are meant more for really low power devices, they’ll give you OK results for simple requests and summarizing, but they’re not going to wow you.

If you look at the ‘tags’ for the models listed below, you’ll see things like 8b-instruct-q8_0 or 8b-instruct-q4_0. The q part refers to quantization, or shrinking/compressing a model and the number after that is roughly how aggressively it was compressed. Note the size of each tag and how the size reduces as the quantization gets more aggressive (smaller numbers). You can roughly think of this size number as “how much video ram do I need to run this model”. For me, I try to aim for q8 models, fp16 if they can run in my GPU. I wouldn’t try to use anything below q4 quantization, there seems to be a lot of quality loss below q4. Models can run partially or even fully on a CPU but that’s much slower. Ollama doesn’t yet support these new NPUs found in new laptops/processors, but work is happening there.

Llama 3.1 - The 8b instruct model is pretty good, decent speed and good quality. This is a good “default” model to use
Llama 3.2 - This model was just released yesterday. I’m only seeing the 1b and 3b models right now. They’ve changed the 8b model to 11b, I’m assuming the 11b model is going to be my new goto when it’s available.
Deepseek Coder v2 - A great coding assistant model
Command-r - This is a more niche model, mainly useful for RAG. It’s only available in a 35b parameter model, so not all that feasible to run locally
Mistral small - A really good model, in the ballpark of Llama. I haven’t had quite as much luck with this as with Llama but it is good and I just saw that a new version was released 8 days ago, will need to check it out again

Pumpkin Escobar@lemmy.world · 2 months ago

It’s a good thing that real open source models are getting good enough to compete with or exceed OpenAI.

Pumpkin Escobar@lemmy.world · 2 months ago

I like the game, but agree with the over-tutorialed complaints. They have two difficulty modes, I wish only story mode got all the handholding. I think there’s enough obvious indicators to get you through all the game mechanics.

Pumpkin Escobar@lemmy.world · 2 months ago

It has been on my list to figure out how to move to forgejo, need to do it soon before the migration process breaks or gets awful.

Pumpkin Escobar@lemmy.world · 2 months ago

surely he’ll be less of a twat then. right?

Pumpkin Escobar@lemmy.world · 3 months ago

Donnie Darko - Just such a great, strange movie

Pumpkin Escobar@lemmy.world · 3 months ago

I guess it wasn’t bacon I hate for breakfast yesterday.

Why do you hate bacon, are you a windmill?

Pumpkin Escobar@lemmy.world · 3 months ago

Things I will bet money on

They will produce no evidence of any wrongdoing uncovered from any of these raids
They will give some cryptic statement that tries to make it sound like they did find something
Texas lawmakers will continue to not hold Paxton accountable for anything

Pumpkin Escobar@lemmy.world · 3 months ago

deleted by creator

Pumpkin Escobar@lemmy.world · 3 months ago

It’s not a cinematic masterpiece but it had a distinctive look and vibe with a cool soundtrack, interestingly strange plot. I saw it again a few years ago and remembered why I liked it as an angsty teen.

Pumpkin Escobar@lemmy.world · 3 months ago

Really love arch and the AUR. I’ve been tempted to get nix set up for the rare cases when there’s no AUR package or the AUR package is unmaintained. I figure if there’s no package in the AUR or nixpkgs, it’s probably not worth running.

Pumpkin Escobar@lemmy.world · 3 months ago

I’m a Unity noob and even more of a noob in Godot, but the c# development experience is so much better in Godot it’s ridiculous.

I remember what was it like 6 years ago when Unity announced moving towards .net core. I can appreciate thats a large effort, but they’ve made ridiculously little progress that I can see