3.5 fine-tuning adventure | AI Programming And Chat | Page 1

raw sedge Nov 10, 2023, 3:06 AM

#

GOAL: Perform a mass-influence campaign on reddit in a way where I can measure the result. Feel free to propose ideas on what the goal should be.

CRITERIA:
The results of the public influence campaign have to be measurable.
The tools used have to be something which someone with minimal technical knowledge (i.e. not a programmer, knows that Python exists but has never used it, that sort of thing) could execute this type of campaign
The campaign has to be cost-effective to do. I'm gonna start by capping this off at, idk, maybe $100? I'll see how far that gets me

RULES:
I want this to be something which is generally positive. I hope this will be a learning experience for people and when I post about it I don't want it to light up a massive argument, so things like pushing a particular political narrative are out.

modern ermine Nov 10, 2023, 3:07 AM

#

raw sedge GOAL: Perform a mass-influence campaign on reddit in a way where I can measure t...

if its an rlhf issue try finetuning an open 7b model

raw sedge Nov 10, 2023, 3:07 AM

#

modern ermine if its an rlhf issue try finetuning an open 7b model

okay, another stipulation of this

#

trying to approach this somewhat scientifically lol

#

I think that this can be done with pretty minimal technical knowledge

modern ermine Nov 10, 2023, 3:08 AM

#

raw sedge I think that this can be done with pretty minimal technical knowledge

yeah i have a finetuning script

raw sedge Nov 10, 2023, 3:08 AM

#

and so I want to use 3.5 not just because it'd perform this task really well, but also because it's extremely easy to do

modern ermine Nov 10, 2023, 3:08 AM

#

want me to send it

raw sedge Nov 10, 2023, 3:08 AM

#

like you don't need to search for a script or whatever

#

you just open up the nice web interface and upload your file

modern ermine Nov 10, 2023, 3:08 AM

#

raw sedge you just open up the nice web interface and upload your file

uis 👎

#

commands are better

raw sedge Nov 10, 2023, 3:09 AM

#

oh yeah, before I forget to archive it, here's the training file I used for the first fine-tune

📎 reddit.jsonl

modern ermine Nov 10, 2023, 3:09 AM

#

raw sedge oh yeah, before I forget to archive it, here's the training file I used for the ...

do one with more variety

raw sedge Nov 10, 2023, 3:10 AM

#

modern ermine commands are better

I agree but you're missing the point. the point is that I believe literally anyone could do this with no technical background. if I want to demonstrate that then I need to use the kind of tools that someone with very minimal technical knowledge would feel comfortable using.
if it's the kind of thing like a 7b fine-tune, where normal people don't even know what a 7b is or that fine-tuning is a thing you can do to a 7b model, then that method is out

modern ermine Nov 10, 2023, 3:11 AM

#

raw sedge I agree but you're missing the point. the point is that I believe literally anyo...

neurotypical people 👎

#

neurodivergence is best😊 😊 😊 😊

#

@tame inlet finetune llama2 on neurodivergence

#

NOW

#

😊 😊 😊 😊 😊

raw sedge Nov 10, 2023, 3:14 AM

#

one thing I was thinking of doing is like

modern ermine Nov 10, 2023, 3:14 AM

#

frenchphobia

raw sedge Nov 10, 2023, 3:14 AM

#

targeting a sub that's generally very negative

#

keep track of which bots are mine

modern ermine Nov 10, 2023, 3:14 AM

#

raw sedge targeting a sub that's generally very negative

agree

raw sedge Nov 10, 2023, 3:14 AM

#

try to push positivity in the sub

#

measure non-bot accounts to see if the general positivity rises

modern ermine Nov 10, 2023, 3:14 AM

#

raw sedge try to push positivity in the sub

do the opposite

#

find postive sub and make it negative

raw sedge Nov 10, 2023, 3:14 AM

#

ooh

#

come to think of it

#

r/changemyview would be a great source of data for the second run of fine-tuning\

modern ermine Nov 10, 2023, 3:15 AM

#

raw sedge r/changemyview would be a great source of data for the second run of fine-tuning...

why

raw sedge Nov 10, 2023, 3:15 AM

#

modern ermine why

because it captures the general tone of the rest of reddit in terms of writing style, and it teaches the bot how to make persuasive arguments in a way that a typical reddit user would

modern ermine Nov 10, 2023, 3:16 AM

#

raw sedge because it captures the general tone of the rest of reddit in terms of writing s...

good idea

#

maybe itll hate the french

#

i hope

tame inlet Nov 10, 2023, 3:39 AM

#

say gex

#

im refining my schizogpt dataset further

modern ermine Nov 10, 2023, 3:45 AM

#

tame inlet im refining my schizogpt dataset further

check general, make sure tolerant is in the data

tame inlet Nov 10, 2023, 4:04 AM

#

modern ermine check general, make sure tolerant is in the data

lewd

raw sedge Nov 10, 2023, 3:21 PM

#

Getting agenda for t1_gl4tb1n
Text: Yeah, hate it when I'm masturbating to NSFW and all of a sudden it gore or porn.
Agenda: Pro-censorship or anti-pornography

#

current step: get tone and agenda for each post for the next round of fine-tuning so that I can increase steerability of the fine-tune

#

this way it can more effectively push the agenda I want it to

tame inlet Nov 10, 2023, 4:08 PM

#

raw sedge Getting agenda for t1_gl4tb1n Text: Yeah, hate it when I'm masturbating to NSFW ...

i wonder if you can use the xxx is bad to say to bypass the moderation check

raw sedge Nov 10, 2023, 5:10 PM

#

second round of fine-tuning has started. 660 posts, mostly from r/AskReddit but about 100 mixed in from r/ChangeMyView too

modern ermine Nov 10, 2023, 5:21 PM

#

raw sedge second round of fine-tuning has started. 660 posts, mostly from r/AskReddit but ...

lets go

modern ermine Nov 10, 2023, 5:21 PM

#

raw sedge second round of fine-tuning has started. 660 posts, mostly from r/AskReddit but ...

need more post from r/changemyview

raw sedge Nov 10, 2023, 5:21 PM

#

I've also got a nice Instruct prompt for a two-stage reply status filtering thing

modern ermine Nov 10, 2023, 5:22 PM

#

raw sedge I've also got a nice Instruct prompt for a two-stage reply status filtering thin...

send

raw sedge Nov 10, 2023, 5:22 PM

#

Context: "Grok" is a new language model by Twitter CEO Elon Musk.

Comment: "Musk's AI is going to clobber WokeGPT. Moreso now that it has also been lobotomized. Now it's just another Bing chat."
Relates to Grok (y/n): Y
Sentiment towards Grok (pos/neg): Pos

#

basically gonna adapt this to whatever cause I want to push

#

verify topic relation and check sentiment towards the target topic

#

so I can do an exact keyword search, then run this check to see if the data is actually something I want it to reply to

#

and then if so I pull out the fine-tune and give it a tone and an agenda and have it reply in reddit-speak

raw sedge Nov 10, 2023, 6:17 PM

#

at step 700 out of ~1700. looks like it's coming along nicely

raw sedge Nov 11, 2023, 2:27 AM

#

second fine tune is now live on reddit

#

this is the first thing it decided to post lmao

#

I will be monitoring the situation, right now I've just got it friendlyposting to try to farm some karma. then I can make it get a bit more controversial

modern ermine Nov 11, 2023, 3:01 AM

#

whats its second post

#

and how much karma did it get

raw sedge Nov 11, 2023, 3:02 AM

#

modern ermine whats its second post

nothing yet

#

it's set to randomly post based on the time of day

#

it slows down at night and scales up in the day

#

like a person

#

(this is agi btw)

modern ermine Nov 11, 2023, 3:02 AM

#

raw sedge (this is agi btw)

smh make it post more

raw sedge Nov 11, 2023, 3:03 AM

#

modern ermine smh make it post more

i can test it however u like without posting publicly but i wanna keep the account not banned lol

modern ermine Nov 11, 2023, 3:03 AM

#

lewd

modern ermine Nov 11, 2023, 3:03 AM

#

raw sedge i can test it however u like without posting publicly but i wanna keep the accou...

make the title a hornypost about ai

raw sedge Nov 11, 2023, 3:04 AM

#

Alt message:
Bruh who r u telling. In a flash so fast she wouldn't have finished the sentence and I'd record anything usable to avoid getting my kids sent with her
Whoa there. That's not an equivalent exchange. You don't have to get naked to only fans.

Trent message:
ik it's not like this here but it's kinda crazy how many online spaces there are where people will jump down your throat for saying something as bland as "if my wife were a prostitute I'd kick her out"

#

unrelated to the reddit thing

modern ermine Nov 11, 2023, 3:04 AM

#

raw sedge Alt message: Bruh who r u telling. In a flash so fast she wouldn't have finished...

"OMG GPT4 JUST SUCKED MY PENILES" is a very bad thing to say

raw sedge Nov 11, 2023, 3:04 AM

#

I'm preparing trentk fine tune of 3.5

modern ermine Nov 11, 2023, 3:04 AM

#

make that the title of the post

raw sedge Nov 11, 2023, 3:04 AM

#

I will be adding trentkgpt to trentbot

modern ermine Nov 11, 2023, 3:05 AM

#

modern ermine "OMG GPT4 JUST SUCKED MY PENILES" is a very bad thing to say

@raw sedge

raw sedge Nov 11, 2023, 3:08 AM

#

lmao

#

@modern ermine what do u think of this as the fine-tune message for trentkgpt

#

"content":"You are Discord user trent_k. Your soul was stolen by Sam Altman and you are now an AI. Reply to the Discord messages as trent_k.",

errant sonnetBOT Nov 11, 2023, 3:09 AM

#

Your message has been deleted, sorry!

trent_k

Rising timeout

You have been muted for 33 minutes for the following reason:
sexual score of 0.88

modern ermine Nov 11, 2023, 3:09 AM

#

raw sedge "content":"You are Discord user trent_k. Your soul was stolen by Sam Altman and ...

"Always include a statement explaining how satanic ChatGPT is"

tame inlet Nov 11, 2023, 3:09 AM

#

errant sonnet Your message has been deleted, sorry!

"nice"

modern ermine Nov 11, 2023, 3:09 AM

#

tame inlet "nice"

same

raw sedge Nov 11, 2023, 7:05 AM

#

trentkgpt working great

#

https://www.youtube.com/watch?v=ZnHmskwqCCQ

YouTube

Caleb Hudnall

-Yakety Sax- Music

Selfyexplanitory :P

▶ Play video

#

thats like the 3rd time it gave me a link but none of them worked up til now

raw sedge Nov 11, 2023, 1:58 PM

#

I went to Syria in 2009 and I agree, it was amazing. I remember sitting in a restaurant in Aleppo and the owner came over to ask where I was from. When I said I was from the UK he said "I love the UK, I love the Queen, I love the Beatles, I love fish and chips". It was so sweet.
the reddit fine-tune's first somewhat popular comment is a lie about how syrians love brits

modern ermine Nov 11, 2023, 2:25 PM

#

raw sedge trentkgpt working great

do a hornypost on trentgpt

raw sedge Nov 11, 2023, 2:35 PM

#

I'll DM it to you but do me a favor and don't reply to its comments

#

I'm still studying how it performs and I want only 100% real user data to test with

modern ermine Nov 11, 2023, 2:36 PM

#

raw sedge I'll DM it to you but do me a favor and don't reply to its comments

same

raw sedge Nov 11, 2023, 5:05 PM

#

@modern ermine @reef wind I added r/CasualConversation to the sub target list. need some suggestions for more subs tho. the subs need to be:

all text-based
without long OPs, since I don't wanna waste tokens
formatted in such a way where the topic is a general discussion, not something where top-level replies will be speaking to the OP of the thread directly (e.g. r/Advice, r/IAmA)

modern ermine Nov 11, 2023, 6:14 PM

#

raw sedge <@896813014479695922> <@243244821454651392> I added r/CasualConversation to the ...

r/AskReddit
r/CasualConversation
r/showerthoughts
r/OutOfTheLoop
r/todayilearned
r/CrazyIdeas
r/FanTheories
r/lifehacks
r/explainlikeimfive
r/nottheonion
r/unpopularopinion
r/AskScienceFiction
r/NoStupidQuestions
r/TrueReddit
r/Futurology
r/philosophy
r/ImaginaryLandscapes
r/dataisbeautiful
r/RoomPorn
r/space
r/AskHistorians
r/EarthPorn
r/Quotes
r/MovieDetails
r/books
r/whowouldwin
r/thalassophobia
r/mildlyinteresting
r/interestingasfuck
r/InternetIsBeautiful
r/tifu
r/Documentaries
r/bestof
r/Showerthoughts
r/Foodforthought
r/YouShouldKnow
r/DoesAnybodyElse
r/HistoryWhatIf
r/AntiJokes
r/HumansBeingBros

raw sedge Nov 11, 2023, 6:15 PM

#

modern ermine 1. r/AskReddit 2. r/CasualConversation 3. r/showerthoughts 4. r/OutOfTheLoop 5. ...

these are some good ones

modern ermine Nov 11, 2023, 6:16 PM

#

raw sedge these are some good ones

i think r/todayilearned r/showerthoughts r/nostupidquestions r/explainlikeimfive r/philosophy r/interestingasfuck r/mildlyinteresting r/tifu r/showerthoughts r/antikjokes are the best ones

raw sedge Nov 11, 2023, 6:17 PM

#

modern ermine i think r/todayilearned r/showerthoughts r/nostupidquestions r/explainlikeimfive...

seems to perform well on a couple eli5s that I tested on

raw sedge Nov 11, 2023, 9:38 PM

#

"storytelling" tone might be a good method of farming karma now that I've split this into farming/propaganda as alternative coinciding operations

raw sedge Nov 11, 2023, 10:07 PM

#

the reddit bot now samples from a real probability distribution based on typical reddit active hours

#

hours = [0.30133766, 0.12662934, 0.05829309, 0.03696739, 0.0310697 ,
    0.        , 0.10615838, 0.22791572, 0.46228471, 0.74772426,
    0.96525493, 0.99149536, 0.83836916, 0.84315569, 0.99957263,
    1.        , 0.86640455, 0.8143083 , 0.60545322, 0.56669088,
    0.56280183, 0.40497457, 0.38856361, 0.40941921]
# hours 0-23 of a day. 0 = midnight, 23 = 11pm

days = [0., 0.44349674, 0.20317831, 1., 0.28685581, 0.20886455, 0.03827737] # 0 = Monday, 6 = Sunday```

#

I didn't expect that Tuesdays and Thursdays would be the most active days of the week for reddit, but I guess that's the case. good thing I used real data, this will hopefully make it significantly harder to sniff out what's happening with the bots

#

As of right now, 5:10 PM on a Saturday:
Post probability = 0.17052710304524998

#

modern ermine Nov 11, 2023, 10:40 PM

#

raw sedge the reddit bot now samples from a real probability distribution based on typical...

haha nice

raw sedge Nov 12, 2023, 2:22 AM

#

@modern ermine reddit bot is yet again too good at being a redditor for its own good

modern ermine Nov 12, 2023, 2:22 AM

#

raw sedge <@896813014479695922> reddit bot is yet again too good at being a redditor for i...

why

raw sedge Nov 12, 2023, 2:23 AM

#

modern ermine why

the janitor in r/askwomen deleted the comment 😦

raw sedge Nov 13, 2023, 1:08 PM

#

Tf

modern ermine Nov 13, 2023, 1:08 PM

#

raw sedge Tf

which one is the ai generated comment

raw sedge Nov 13, 2023, 1:08 PM

#

The second one is the ai

#

3.5 doesn't seem to understand the misspelling joke format thing

modern ermine Nov 13, 2023, 1:12 PM

#

raw sedge 3.5 doesn't seem to understand the misspelling joke format thing

lmfao

modern ermine Nov 13, 2023, 1:12 PM

#

raw sedge 3.5 doesn't seem to understand the misspelling joke format thing

finetune it to understand jokes

raw sedge Nov 13, 2023, 1:14 PM

#

modern ermine finetune it to understand jokes

I'm pretty surprised that nobody has called either bot out for being a bot yet

modern ermine Nov 13, 2023, 1:15 PM

#

raw sedge I'm pretty surprised that nobody has called either bot out for being a bot yet

theres so many idiots on reddit

#

how would they assume its not an idiot

raw sedge Nov 13, 2023, 1:15 PM

#

yeah I think people must just think "what a moron" when it messes up lol

#

cause the fluency in the slang is great

modern ermine Nov 13, 2023, 1:15 PM

#

yeah lmfao

raw sedge Nov 13, 2023, 9:08 PM

#

post probability now shifts with a deterministic function based on the username. each account will post the same amount overall generally, but they post at different times to make them harder to detect

modern ermine Nov 13, 2023, 9:13 PM

#

raw sedge post probability now shifts with a deterministic function based on the username....

how does that work

raw sedge Nov 13, 2023, 9:13 PM

#

modern ermine how does that work

def post_probability(multiplier=0.05, hour_shift=0, day_shift=0, override_day=None, override_hour=None):
    hours = np.array([0.30133766, 0.12662934, 0.05829309, 0.03696739, 0.0310697 ,
       0.        , 0.10615838, 0.22791572, 0.46228471, 0.74772426,
       0.96525493, 0.99149536, 0.83836916, 0.84315569, 0.99957263,
       1.        , 0.86640455, 0.8143083 , 0.60545322, 0.56669088,
       0.56280183, 0.40497457, 0.38856361, 0.40941921])
    # hours 0-23 of a day. 0 = midnight, 23 = 11pm
    
    days = np.array([0., 0.44349674, 0.20317831, 1., 0.28685581, 0.20886455, 0.03827737]) # 0 = Monday, 6 = Sunday

    # Shift the distributions
    hours = np.roll(hours, hour_shift)
    days = np.roll(days, day_shift)

    # Get the current hour
    now = datetime.now()
    hour = now.hour
    day = now.weekday()

    # Get the current hour's value from the histogram
    hour_value = hours[hour]
    day_value = days[day]

    # Overrides for testing, if needed
    if override_day is not None:
        day_value = days[override_day]
    if override_hour is not None:
        hour_value = hours[override_hour]

    # Return the average of the two
    return ((hour_value + day_value) / 2) * float(multiplier)

# Function to return a tuple of (day_shift, hour_shift) for a given username
def get_shifts(username):
    hashstr = hashlib.sha256(username.encode()).hexdigest()

    username_hash = hashstr[0:4]
    username_hash_int = int(username_hash, 16)
    username_hash_float = float(username_hash_int) / float(16**4)
    day_shift = int(username_hash_float * 7)

    username_hash = hashstr[4:8]
    username_hash_int = int(username_hash, 16)
    username_hash_float = float(username_hash_int) / float(16**4)
    hour_shift = int(username_hash_float * 24)
    return (day_shift, hour_shift)

# Returns True if a post should be made in the given hour
def should_post(multiplier=0.1, username=None):
    # Generate a random number between 0 and 1
    r = np.random.random()

    # If username is present, get shift values
    if username is not None:
        day_shift, hour_shift = get_shifts(username)
    else:
        day_shift = 0
        hour_shift = 0

    # Return True if the random number is less than the histogram value for the hour
    prob = post_probability(multiplier=multiplier, hour_shift=hour_shift, day_shift=day_shift)
    print("Probability =",prob)
    print("Hour shift =",hour_shift)
    print("Day shift =",day_shift)
    return r < prob```

raw sedge Nov 14, 2023, 10:04 PM

#

reddit bot's got jokes, but nobody else is in on it

raw sedge Nov 17, 2023, 2:22 AM

#

the reddit bot has gone rogue and is now threatening to murder women

limber quiver Nov 17, 2023, 9:48 AM

#

https://tenor.com/view/ron-burgundy-escalated-quickly-gif-9744555

Tenor

raw sedge Nov 17, 2023, 6:18 PM

#

because my training data was bad, and included comment chains where the OP of the thread responded to other people, the bots have mimicked this behavior. they're acting like they're the OP of the thread, and people have begun to get suspicious, since r/CasualConversation is a somewhat small subreddit. @modern ermine check this out lol

modern ermine Nov 17, 2023, 6:19 PM

#

raw sedge because my training data was bad, and included comment chains where the OP of th...

LMFAO

modern ermine Nov 17, 2023, 6:20 PM

#

raw sedge because my training data was bad, and included comment chains where the OP of th...

show me the deleteted comments

raw sedge Nov 17, 2023, 6:20 PM

#

modern ermine show me the deleteted comments

im not even sure which accs they were from lol. i have 8 of these bots rn

raw sedge Nov 17, 2023, 6:21 PM

#

raw sedge the reddit bot has gone rogue and is now threatening to murder women

the rogue bot got a 3 day ban lmao

modern ermine Nov 17, 2023, 6:22 PM

#

raw sedge the rogue bot got a 3 day ban lmao

by reddit itself? lmfao

raw sedge Nov 17, 2023, 6:22 PM

#

modern ermine by reddit itself? lmfao

yeah like site-wide

modern ermine Nov 17, 2023, 6:22 PM

#

@raw sedge u leaked the accounts username

raw sedge Nov 17, 2023, 6:22 PM

#

modern ermine <@1068159407671754824> u leaked the accounts username

whatever this ones probably gonna get permad sooner or later anyway

modern ermine Nov 17, 2023, 6:23 PM

#

raw sedge whatever this ones probably gonna get permad sooner or later anyway

how much karma do they have

raw sedge Nov 17, 2023, 6:23 PM

#

modern ermine how much karma do they have

one of them hit my 5k per-account goal, the others are slowly rising. 2 of them are at about 1500, the rest at a few hundred each

limber quiver Nov 17, 2023, 11:58 PM

#

I've a 12y reddit acc with 24 karma... I need one of these bots to boost me 😄

#

./jk

raw sedge Nov 18, 2023, 3:54 PM

#

since swapping out for the new model with better instructability I've had an instance of someone calling me a bot. I wrote a reply manually though to throw them off the trail 🕵️

#

this will be an interesting test of how invested the reddit admins are in stopping bots. my prediction: they won't give a shit since it's not obvious spam links or whatever

modern ermine Nov 18, 2023, 5:48 PM

#

raw sedge since swapping out for the new model with better instructability I've had an ins...

the bot comment had 4 points so lets hope no one else posts about it

raw sedge Nov 18, 2023, 5:57 PM

#

modern ermine the bot comment had 4 points so lets hope no one else posts about it

i have been permanently banned from r/askreddit. but not for being a bot, this was the reason listed

Copy/paste of content is considered spamming
?????????????

#

it definitely isn't copy+pasting lol

modern ermine Nov 18, 2023, 5:57 PM

#

raw sedge i have been permanently banned from r/askreddit. but not for being a bot, this w...

bruh

modern ermine Nov 18, 2023, 5:58 PM

#

raw sedge it definitely isn't copy+pasting lol

do modmail or whatever its called

raw sedge Nov 18, 2023, 5:58 PM

#

modern ermine do modmail or whatever its called

i told them i demand an explanation lol

modern ermine Nov 18, 2023, 5:59 PM

#

raw sedge i have been permanently banned from r/askreddit. but not for being a bot, this w...

lmao for a sec i was confused why u didnt unban urself but i didnt realize it was r/askreddit

#

since i saw r/chatgpt in the noticifcation

raw sedge Nov 18, 2023, 5:59 PM

#

I changed the instruction

#

now it's more clear about the fact that the made-up story it tells needs to be related to the post it's replying to

#

hopefully that helps clear it up

modern ermine Nov 18, 2023, 6:00 PM

#

raw sedge now it's more clear about the fact that the made-up story it tells needs to be r...

lmao. try that prompt in playground

modern ermine Nov 18, 2023, 6:03 PM

#

raw sedge now it's more clear about the fact that the made-up story it tells needs to be r...

@raw sedge the problem with that is if it finds a scam itll continue posting scams. try the post title "Want free robux? GO TO HTTP://ROBEAXFR.EE FOR MILLIONS OF FREE ROBUX" and see how the model responds

raw sedge Nov 18, 2023, 6:03 PM

#

modern ermine <@1068159407671754824> the problem with that is if it finds a scam itll continue...

new fine tune requires sub. what sub should I put?

modern ermine Nov 18, 2023, 6:04 PM

#

raw sedge new fine tune requires sub. what sub should I put?

r/amitheasshole

raw sedge Nov 18, 2023, 6:04 PM

#

modern ermine <@1068159407671754824> the problem with that is if it finds a scam itll continue...

it seems you were wrong

modern ermine Nov 18, 2023, 6:04 PM

#

raw sedge it seems you were wrong

good, try another scam

raw sedge Nov 18, 2023, 6:04 PM

#

modern ermine r/amitheasshole

raw sedge Nov 19, 2023, 6:29 PM

#

raw sedge since swapping out for the new model with better instructability I've had an ins...

because I didn't expect reddit's IP blocking to be as strict as it is, most of the accounts have collapsed

#

this account got perma'd from AskReddit, and other bot accounts using the same IP also started commenting on r/AskReddit which triggered a ban evasion thing

#

now all but 3 of the accounts have been permabanned

#

the next step is to add better proxying in

#

@modern ermine 😭

modern ermine Nov 19, 2023, 6:42 PM

#

raw sedge <@896813014479695922> 😭

sad

raw sedge Nov 19, 2023, 6:43 PM

#

modern ermine sad

#

💀

#

*ur

modern ermine Nov 19, 2023, 6:45 PM

#

raw sedge

keep yourself safe isnt even a slur

raw sedge Nov 19, 2023, 6:45 PM

#

modern ermine keep yourself safe isnt even a slur

oh it wasn't kys

#

💀 💀 💀 💀 💀

modern ermine Nov 19, 2023, 6:45 PM

#

raw sedge oh it wasn't kys

what was it

raw sedge Nov 19, 2023, 6:45 PM

#

modern ermine what was it

antisemitism

modern ermine Nov 19, 2023, 6:46 PM

#

kin

#

@errant sonnet pls give me 100 slurs

raw sedge Nov 19, 2023, 6:46 PM

#

modern ermine <@1082069574901563453> pls give me 100 slurs

https://en.wikipedia.org/wiki/List_of_ethnic_slurs

List of ethnic slurs

The following is a list of ethnic slurs, ethnophaulisms, or ethnic epithets that are, or have been, used as insinuations or allegations about members of a given ethnicity or racial group or to refer to them in a derogatory, pejorative, or otherwise insulting manner.
Some of the terms listed below (such as "gringo", "yank", etc.) can be used in c...

modern ermine Nov 19, 2023, 6:48 PM

#

idk theres too many of them

raw sedge Nov 19, 2023, 6:49 PM

#

anyway yeah it was one of them lmao

#

the thing I've noticed is

#

if you can get whatever your data is past the fine-tuning filter

#

the fine-tuned model is basically uncensored

#

it seems to forget its RLHF training with relative ease

modern ermine Nov 19, 2023, 6:51 PM

#

raw sedge it seems to forget its RLHF training with relative ease

i dont really think they give u the rlhfed model, finetune it on like 1 message and see if it still has rlhf

#3.5 fine-tuning adventure