How to run LLaMA (and other LLMs) on Android.

llama@lemmy.dbzer0.com · edit-2 13 hours ago

How to run LLaMA (and other LLMs) on Android.

Rhaedas@fedia.io · 1 day ago

I’ve run a local LLM on my PC for a while, so I’m familiar enough with Ollama to understand what’s going on. I’ve tried this with my Samsung Tracfone, not really expecting a lot. Surprisingly I’ve gotten all the way to getting a prompt, but then things crash and I’m kicked back to the starting terminal. Pretty sure it’s memory, so I’m now trying to use virtual memory to bump it up to the 4GB you’ve had success with (the phone looks to have 3GB actual memory, plenty of storage though).

If it doesn’t work, I’ll try some of the others, perhaps they’re a bit smaller.

I did get the 0.5 Qwen to run well. I’m surprised how fast it is even using CPU mode, but maybe being smaller also helps with the processing.

Just a tip (maybe obvious to experienced users): while you do have to run the terminal, login to debian, start the server and then run the model, remember that you can use the arrow keys in the terminal to repeat past commands, so it’s pretty quick to do. I actually missed the arrow keys the first time around because they aren’t very distinct or highlighted, but then when I had to look for how to do CTRL, I realized they were right in front of me.

llama@lemmy.dbzer0.com · 23 hours ago

I have tried on more or less 5 spare phones. None of them have less than 4 GB of RAM, however.

How to run LLaMA (and other LLMs) on Android.

How to run LLaMA (and other LLMs) on Android.

Step 1: Install Termux

Step 2: Set Up proot-distro and Install Debian

Step 3: Install Dependencies

Step 4: Install Ollama

Step 5: Download and run the Llama3.2:1B Model