for ML engineers: why can't you simply exclude the word "fuck"?

posted 1 year ago

So, I’ve heard that ML manipulates tokens and specifically for the English corpora they take place of words. If we want model to be polite and not to speak uncomfortable language we can remove certain words from the internal array where all tokens and their associative data are stored, for example “fuck”.

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments View context

[ - ]

BetaDoggo_@lemmy.world

4 points

1 year ago

Chatgpt’s sampling parameters are unknown, and it definitely doesn’t choose the 3rd most likely. More complicated sampling methods are probably used, such as temperature, top p and top k.

permalink

report

parent

[ - ]

BURN@lemmy.world

1 point

1 year ago

Correct, but also way over the level of the average reader

I probably should have used a different example other than ChatGPT tbh

permalink

report

parent

[ - ]

wispydust@sh.itjust.works

1 point

1 year ago

That’s alright. You did good simplifying an unrelated idea for the sake of explaining another concept.

permalink

report

parent

Ask Lemmy

!asklemmy@lemmy.world

Create post

A Fediverse community for open-ended, thought provoking questions

Please don’t post about US Politics. If you need to do this, try !politicaldiscussion@lemmy.world

Rules: (interactive)

1) Be nice and; have fun

Doxxing, trolling, sealioning, racism, and toxicity are not welcomed in AskLemmy. Remember what your mother said: if you can’t say something nice, don’t say anything at all. In addition, the site-wide Lemmy.world terms of service also apply here. Please familiarize yourself with them

2) All posts must end with a '?'

This is sort of like Jeopardy. Please phrase all post titles in the form of a proper question ending with ?

3) No spam

Please do not flood the community with nonsense. Actual suspected spammers will be banned on site. No astroturfing.

4) NSFW is okay, within reason

Just remember to tag posts with either a content warning or a [NSFW] tag. Overtly sexual posts are not allowed, please direct them to either !asklemmyafterdark@lemmy.world or !asklemmynsfw@lemmynsfw.com. NSFW comments should be restricted to posts tagged [NSFW].

5) This is not a support community.

It is not a place for ‘how do I?’, type questions. If you have any questions regarding the site itself or would like to report a community, please direct them to Lemmy.world Support or email info@lemmy.world. For other questions check our partnered communities list, or use the search function.

Reminder: The terms of service apply here too.

Partnered Communities:

Logo design credit goes to: tubbadu

Community stats

11K
Monthly active users
4.3K
Posts
227K
Comments

A Fediverse community for open-ended, thought provoking questions

Rules: (interactive)

Partnered Communities:

Community stats

Community moderators