You are viewing a single thread.
View all comments 53 points
*
The quantized model you can run locally works decently and they can’t read any of it, which is nice.
I use that one specifically https://huggingface.co/lmstudio-community/Meta-Llama-3-8B-Instruct-GGUF/blob/main/Meta-Llama-3-8B-Instruct-Q4_K_M.gguf
If you’re looking for a relatively user-friendly software to use it, you can look at GPT4All (open source) or LM Studio.
12 points
8 points
16 points
If you’re ready to tinker a bit i can recommend Ollama for the backend and Open web UI for the frontend. They can also both run on the same machine.
The advantage is that you can use your GPU to compute, which is a lot faster.