[Help] Trying to run a local Story telling model with KoboldCpp

posted 1 year ago

Hi,

Just like the title says:

I’m try to run:

With:

Running :

--stream --unbantokens --threads 8 --usecublas normal

I get very limited output with lots of repetition.

I mostly didn’t touch the default settings:

Does anyone know how I can make things run better?

EDIT: Sorry for multiple posts, Fediverse bugged out.

Sort:

You are viewing a single thread.

[ - ]

3 points

1 year ago

I’m not familiar with koboldcpp, but i can see that you may have „Amount to Gen“ set very low. Try to increase it to a higher amount.

report

LocalLLaMA

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.