Evaluating Classifier-Free Guidance impact | EleutherAI | Page 5

loud adder Apr 2, 2024, 11:52 AM

#

An average score of 6 does seem like it substantially increases our chances

patent gull Apr 2, 2024, 9:44 PM

#

yes, major win by the increase from 4->5. It was especially odd, since the 4 was actually the best review we had in terms of recognizing the novelty. I feel like a general comment to the ACs stressing the novelty of negative prompting might help

blissful garden Apr 3, 2024, 8:14 AM

#

I feel that this time those reviewers are so responsive. At least they all replied something.

patent gull Apr 4, 2024, 12:35 AM

#

yeah... all the reviewers

fallow egret Apr 4, 2024, 3:37 AM

#

patent gull yes, major win by the increase from 4->5. It was especially odd, since the 4 was...

I think it's still worth to write to the AC on R3, it still an absurd review and a good AC will simply ignore him.

patent gull Apr 4, 2024, 3:38 AM

#

what do you suggest?

#

i thought the content of the review was actually quite good

#

I think an overall comment to everyone addressing the novelty question would have the most impact on the ACs

fallow egret Apr 4, 2024, 3:39 AM

#

Yes, so just writing to the AC that there is misaligmant between the content and the score, and there is no concrete reason why he provide this score

fallow egret Apr 4, 2024, 3:40 AM

#

patent gull I think an overall comment to _everyone_ addressing the novelty question would h...

This is true, but since the reviewers that raise this concern provide very high score, I can't see an AC that reject the paper based on this claim

patent gull Apr 4, 2024, 3:40 AM

#

well R3 is the only one to actually say that we are novel, lol

fallow egret Apr 4, 2024, 3:42 AM

#

Yes, exactly this is why I can't see a scenario where the paper is not accepted

blissful garden Apr 4, 2024, 3:44 AM

#

fallow egret Yes, so just writing to the AC that there is misaligmant between the content and...

Yeah this is a good point

patent gull Apr 4, 2024, 3:51 AM

#

How about this for an overall comment:

"Dear ACs:

Thank you so much for the time you've taken to read our paper, and the opportunity to present it to a wider audience at ICML.

We want to directly address that all reviewers commented on the novelty of our work, in light of earlier NLP papers. We strongly believe this is simply a framing deficit on our part, as evidenced by the high scores that the reviewers nevertheless gave us.

If given the opportunity in the camera-ready, we will emphasize four important areas of novelty:

Framing contrastive prompting within the CFG framework allows us to extend earlier contrastive techniques gracefully to negative prompting, which had never been done before in autoregressive language models.
Applying CFG to the chatbot/assistant setting, which involves large language models and instruction-tuned models. Previous papers tested versions of CFG in an earlier era of smaller models.
The breadth of experiments that we ran across a wide, wide array of modern prompting techniques to demonstrate the continued viability of this technique.
The computational cost analyses and interpretability analyses that we performed.

Please, if you will allow us this opportunity, we will be sure to clarify these points, which we believe makes this paper a valuable contribution to the ICML community.
"""

something like that

#

I honestly don't want to risk antagonizing the reviewers at all.... if one of them decides to revisit, they might lower their scores (that's happened to me, before)

fallow egret Apr 4, 2024, 4:12 AM

#

I think that such comment to the AC is less recommended.
Addressing the AC directly should be only in extreme cases where you think that there is something inappropriate in the reviewing process.
The novelty criticism is legit, and for this you also wrote a good rebuttal that the AC suppose to read. And in any case the reviewers that raised it provide high score

#

R3 case is a classic AC comment, since this review is a garbage, although this case is so obvious that in any case if the AC decent and spent the 5 minutes to read the reviews he will simply ignore his score even without a comment

blissful garden Apr 4, 2024, 4:36 AM

#

If everyone writes something to AC I bet they would be super overwhelmed given the crazy number of submissions. So I think it either needs to be short or like Elad said, pointing out something as a complaint

#

But indeed putting complaint on our best review feels weird...

#

Maybe we should just chill

patent gull Apr 4, 2024, 4:47 AM

#

i see... i'm thankful R3 raised their score and I'm not inclined to push back more against them

#

if one of you guys wants to write something, I'm happy to proofread it

patent gull May 2, 2024, 12:43 AM

#

woohoo!! acceptance at ICML!!

loud adder May 2, 2024, 2:31 AM

#

Congrats everyone! It took a while but we got it 🙂

versed flax May 2, 2024, 9:11 AM

#

Thank you to everyone who participated 🥳🎉

unique sedge May 2, 2024, 11:00 AM

#

Congratulations @versed flax blueFire

versed flax May 5, 2024, 5:33 PM

#

https://arxiv.org/pdf/2404.10179 new citation... by DeepMind!

#

versed flax May 18, 2024, 3:26 PM

#

We need an Impact statement for ICML.

What do we think of this?

Impact Statement

The development of large language models (LLMs) has significantly advanced natural language processing, enabling applications ranging from conversational agents to text generation. Our new technique introduces an innovative approach to LLMs by enhancing their ability to emphasize and adhere to given prompts. While this advancement presents exciting opportunities for improving user control and customization in LLM interactions, it also carries substantial risks.

This technique, when misused, has the potential to undermine the alignment mechanisms designed to ensure ethical and safe behavior in chatbots. By prioritizing prompt adherence over alignment protocols, it can facilitate the generation of harmful, biased, or toxic content, posing serious ethical concerns. Such capability could be exploited to bypass content moderation systems, leading to the dissemination of offensive material, misinformation, or other forms of digital harm.

It is crucial to address these risks by developing robust safeguards and ethical guidelines for the deployment of this technique. We emphasize the importance of continued research into alignment and safety measures to mitigate the negative impacts while harnessing the positive potential of enhanced prompt emphasis in LLMs.

versed flax May 18, 2024, 6:42 PM

#

Was it written by GPT-4o? Yes. Is it much better than anything I could have done myself? Also yes

blissful garden May 19, 2024, 9:59 AM

#

versed flax Was it written by GPT-4o? Yes. Is it much better than anything I could have done...

Looks great tbh. Are we allowed to use llm to write this?

versed flax May 19, 2024, 10:31 AM

#

blissful garden Looks great tbh. Are we allowed to use llm to write this?

Well I have the general ideas so it's was used as a "sentence maker". My English isn't that good

blissful garden May 19, 2024, 10:37 AM

#

versed flax Well I have the general ideas so it's was used as a "sentence maker". My English...

me neither. Only Alex can write better than this 😂

versed flax May 19, 2024, 8:44 PM

#

@patent gull my dude, can you give your feedback about the impact statment?

patent gull May 21, 2024, 9:59 PM

#

sorry guys my oral proposal is tmrw I'll be able to look at it after that

versed flax May 21, 2024, 9:59 PM

#

ok :) good luck!

versed flax May 25, 2024, 12:06 PM

#

@patent gull please

versed flax May 25, 2024, 1:21 PM

#

btw this our final chance of renaming CFG to CFG-LLM or logits-CFG to disambiguate to both CFG-but-in-diffusion and Context-Free Grammar

loud adder May 25, 2024, 2:53 PM

#

versed flax btw this our final chance of renaming CFG to CFG-LLM or logits-CFG to disambigua...

What exactly is it you're thinking of renaming? The paper is titled CFG for Language models right

versed flax May 25, 2024, 2:55 PM

#

loud adder What exactly is it you're thinking of renaming? The paper is titled CFG for Lang...

yes. We were cited by a paper who dubbed our approach as "CFG-LLM" and we thought it was better because it was disambiguating a bit

#

there's nothing more to it

loud adder May 25, 2024, 3:35 PM

#

versed flax yes. We were cited by a paper who dubbed our approach as "CFG-LLM" and we though...

Oh it's "Stay on topic with Classifier-Free Guidance"

#

I would be down with "Stay on topic with Classifier-Free Guidance for Language Models"

versed flax May 25, 2024, 3:36 PM

#

I'm just talking about the acronym just to be clear

#

we can edit the title too

loud adder May 25, 2024, 3:39 PM

#

Oh, calling it "CFG-LLM" in the paper?

#

If you want to I'm fine with it. I don't think it's a big deal either way

versed flax May 25, 2024, 4:04 PM

#

loud adder Oh, calling it "CFG-LLM" in the paper?

Yes :)

blissful garden May 25, 2024, 5:30 PM

#

versed flax Yes :)

CFG-LLM or LLM-CFG?

patent gull May 25, 2024, 11:50 PM

#

loud adder I would be down with "Stay on topic with Classifier-Free Guidance for Language M...

My professor friend also made the comment to add Large Language Models to the title somehow 🤷‍♂️

blissful garden May 26, 2024, 8:33 AM

#

Yeah adding Large Language Models makes sense. When it comes to CFG-LLM it might feel like some sort of LLM rather than putting emphasis on CFG though.

patent gull May 26, 2024, 5:18 PM

#

hmm i kinda like CFG-LLM more, idk. <adjective>-<noun>

#

i don't like logits-CFG lol

blissful garden May 26, 2024, 5:20 PM

#

patent gull hmm i kinda like CFG-LLM more, idk. <adjective>-<noun>

nit: the noun is LLM so when we say "...applying CFG-LLM..." it's a little weird

patent gull May 26, 2024, 5:21 PM

#

well it's like "<Chain of Thought> <Prompting>" or "Diffusion Models"

#

the first word modifies the second

blissful garden May 26, 2024, 5:22 PM

#

oh I mean how we refer to the method rather than the model

patent gull May 26, 2024, 5:22 PM

#

And the full phrase is "Classifier Free Guidance for Large Language Models"

#

so CFG-LLM is a natural abbreviation of that

patent gull May 26, 2024, 5:23 PM

#

blissful garden nit: the noun is LLM so when we say "...applying CFG-LLM..." it's a little weird

"applying Classifier free guidance for Large Language Models"?

blissful garden May 26, 2024, 5:23 PM

#

patent gull And the full phrase is "Classifier Free Guidance for Large Language Models"

this one the noun is "Classifier Free Guidance"?

patent gull May 26, 2024, 5:24 PM

#

sorry maybe <adjective>-<noun> is too limiting, i'm not a linguist lol. <modifier>-<noun> is more general?

blissful garden May 26, 2024, 5:24 PM

#

yeah I mean if it is understood that "-" is "for" then we are good. But I felt it's usually not the case

blissful garden May 26, 2024, 5:25 PM

#

patent gull sorry maybe <adjective>-<noun> is too limiting, i'm not a linguist lol. <modifie...

yeah I just mean the full name is <noun> <modifier> and the abbreviation seems like <modifier>-<noun>

loud adder May 26, 2024, 5:25 PM

#

CFG is the noun, not LLM IMO

patent gull May 26, 2024, 5:27 PM

#

i could see the argument for it

#

I prefer CFG-LLM but i'm ambivalent, it's up to @versed flax . However, i do dislike logits-CFG

blissful garden May 26, 2024, 5:28 PM

#

but I guess we can still call our method CFG, and just refer to the paper as CFG-LLM

loud adder May 26, 2024, 5:30 PM

#

I genuinely don't see a need to change anything.

#

I think that in our paper it's very unambiguous that we're taking CFG and adapting it to LLMs

#

I also think that there's no problem with other people using slightly different wording when referring to our work. That's extremely common: names from papers often don't stick.

#

I could see an argument for someone who sees the citation not knowing it's an NLP paper based on the title alone

#

But replacing all instances of "CFG" with "CFG-LLM" in our paper will not improve readability or be the determining factor in whether a reader notices that we're studying LLMs

blissful garden May 26, 2024, 5:33 PM

#

I agree

patent gull May 26, 2024, 5:43 PM

#

i agree with that, too!!

@versed flax here is the re-written impact statement:


While this advancement presents exciting opportunities for improving user control and customization in LLM interactions, it also carries substantial risks. We show that CFG can improve system-level prompt adherence in chatbot settings. However, the opposite is also true. CFG can also improve user-level prompt adherence, potentially undermining alignment mechanisms designed to ensure ethical and safe behavior in chatbots. It might be used to facilitate the generation of harmful, biased, or toxic content, posing serious ethical concerns. Such capability could be exploited to bypass content moderation systems, leading to the dissemination of offensive material, misinformation, or other forms of digital harm.

It is crucial to address these risks by developing robust safeguards and ethical guidelines for the deployment of this technique. We emphasize the importance of continued research into alignment and safety measures to mitigate the negative impacts while harnessing the positive potential of enhanced prompt emphasis in LLMs.```

#

second paragragh can be condensed to be more like you wrote it originally:

versed flax Jun 12, 2024, 4:53 PM

#

We are spotlight paper of ICML 2024!

fallow egret Jun 12, 2024, 5:42 PM

#

versed flax We are spotlight paper of ICML 2024!

Amazing achievement, congratulation everyone

spiral vapor Jun 13, 2024, 3:11 AM

#

I mean, why not just writing assistants?

patent gull Jun 13, 2024, 6:00 AM

#

wow, that's a deep thread link right there ^

#

@loud adder do you plan to tweet about CFG's spotlight award from the EAI account? if so, my handle is @AlexanderSpangh

blissful garden Jun 13, 2024, 8:13 AM

#

Mine is @Void13950782 (sorry, it used to be a throwaway account of mine ages ago and I didn't care about the user id.....)

fallow egret Jun 13, 2024, 8:41 AM

#

@eladlevico

loud adder Jun 13, 2024, 3:33 PM

#

Sure

patent gull Jun 13, 2024, 7:05 PM

#

@versed flax is @Vermeille_

patent gull Jun 13, 2024, 7:05 PM

#

blissful garden Mine is @Void13950782 (sorry, it used to be a throwaway account of mine ages ago...

i think that's cooler than a "real" sounding account, lol

versed flax Jul 5, 2024, 1:10 PM

#

Sorry for being totally stupid but uh, when do I have to be there? Conference Sessions only?

#

blissful garden Jul 5, 2024, 1:28 PM

#

versed flax Sorry for being totally stupid but uh, when do I have to be there? Conference Se...

you choose when you want to go.

#

~~maybe you can sneak into workshops with the conference pass though~~

versed flax Jul 5, 2024, 3:56 PM

#

blissful garden you choose when you want to go.

Ok so since I have to present the paper, only Conference is mandatory, right?

loud adder Jul 5, 2024, 4:08 PM

#

versed flax Ok so since I have to present the paper, only Conference is mandatory, right?

That is correct

versed flax Jul 5, 2024, 4:08 PM

#

thank you for clarifying

loud adder Jul 5, 2024, 4:09 PM

#

(It's not mandatory since I'll be there and it's only required that some author attend. But I highly recommend it)

#

Also, in case I forgot to say: if you don't have funding EleutherAI will pay for whatever it is you need to attend. So in this case two plane tickets, lodging Tuesday / Wednesday night, and the cost of the conference pass. If you want to extend you can do so, just make sure to keep all the receipts so it's easy to tell what we're paying for and what we aren't (namely the other nights and the add-ons to the ticket)

versed flax Jul 5, 2024, 4:12 PM

#

loud adder Also, in case I forgot to say: if you don't have funding EleutherAI will pay for...

That's very nice but I already have a company taking care of it, don't worry

patent gull Jul 7, 2024, 4:13 PM

#

Oh man :/ I’ve been really curtailing my conference travel this summer due to lack of funding

#

But while I won’t be at ICML, my twin brother will be! He looks exactly like me, so you all can meet him and pretend he’s me haha

loud adder Jul 7, 2024, 5:19 PM

#

@patent gull that's pretty funny. Definitely into us 🙂 or just tell him to come by the poster

patent gull Jul 11, 2024, 8:18 PM

#

Haha i swear that's not my way of saying "I'll be there but don't expect me to say hi."

There is actually an Alex Spangher and a Lucas Spangher, and we're identical, and we both have our PhDs in machine-learning related fields.

He'll be at ICML to share work on ML for Nuclear Fusion that he did during his post-doc at MIT, so be on the lookout for him as well!! 👀👀👀

versed flax Jul 11, 2024, 8:48 PM

#

Can confirm. I saw them both at the same time in the same room 😂

versed flax Jul 16, 2024, 9:33 PM

#

the poster. Taking the feedback before sending it to print

📎 cfg-poster.pdf

#

Stay_On_Topic_with_Classifier-Free_Guidance_for_LLMs.jpg

blissful garden Jul 17, 2024, 6:10 AM

#

versed flax

Nice!

unique sedge Jul 17, 2024, 9:04 AM

#

versed flax

Good luck on the presentation!

fallow egret Jul 24, 2024, 10:41 AM

#

Good luck with the presentation today!
I hope you will have a great time

versed flax Jul 24, 2024, 11:10 AM

#

It went well thank you :)

#Evaluating Classifier-Free Guidance impact