9

[Help] Trying to run a local Story telling model with KoboldCpp

posted 1 year ago

*

by

darkeox@kbin.social

in

localllama@sh.itjust.works

16 commentshide report

Hi,

Just like the title says:

I’m try to run:

https://huggingface.co/TheBloke/WizardLM-Uncensored-SuperCOT-StoryTelling-30B-SuperHOT-8K-GGML

With:

koboldcpp:v1.43 using HIPBLAS on a 7900XTX / Arch Linux

Running :

--stream --unbantokens --threads 8 --usecublas normal

I get very limited output with lots of repetition.

I mostly didn’t touch the default settings:

Does anyone know how I can make things run better?

EDIT: Sorry for multiple posts, Fediverse bugged out.

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments View context

[ - ]

darkeox@kbin.socialOP

2 points

1 year ago

Alright, thanks for the info & additional pointers.

report

reply

LocalLLaMA

!localllama@sh.itjust.works

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Community stats

84
Monthly active users
196
Posts
755
Comments

Community moderators