#Evaluating Classifier-Free Guidance impact
4107 messages Β· Page 5 of 5 (latest)
yes, major win by the increase from 4->5. It was especially odd, since the 4 was actually the best review we had in terms of recognizing the novelty. I feel like a general comment to the ACs stressing the novelty of negative prompting might help
I feel that this time those reviewers are so responsive. At least they all replied something.
yeah... all the reviewers
I think it's still worth to write to the AC on R3, it still an absurd review and a good AC will simply ignore him.
what do you suggest?
i thought the content of the review was actually quite good
I think an overall comment to everyone addressing the novelty question would have the most impact on the ACs
Yes, so just writing to the AC that there is misaligmant between the content and the score, and there is no concrete reason why he provide this score
This is true, but since the reviewers that raise this concern provide very high score, I can't see an AC that reject the paper based on this claim
well R3 is the only one to actually say that we are novel, lol
Yes, exactly this is why I can't see a scenario where the paper is not accepted
Yeah this is a good point
How about this for an overall comment:
"Dear ACs:
Thank you so much for the time you've taken to read our paper, and the opportunity to present it to a wider audience at ICML.
We want to directly address that all reviewers commented on the novelty of our work, in light of earlier NLP papers. We strongly believe this is simply a framing deficit on our part, as evidenced by the high scores that the reviewers nevertheless gave us.
If given the opportunity in the camera-ready, we will emphasize four important areas of novelty:
- Framing contrastive prompting within the CFG framework allows us to extend earlier contrastive techniques gracefully to negative prompting, which had never been done before in autoregressive language models.
- Applying CFG to the chatbot/assistant setting, which involves large language models and instruction-tuned models. Previous papers tested versions of CFG in an earlier era of smaller models.
- The breadth of experiments that we ran across a wide, wide array of modern prompting techniques to demonstrate the continued viability of this technique.
- The computational cost analyses and interpretability analyses that we performed.
Please, if you will allow us this opportunity, we will be sure to clarify these points, which we believe makes this paper a valuable contribution to the ICML community.
"""
something like that
I honestly don't want to risk antagonizing the reviewers at all.... if one of them decides to revisit, they might lower their scores (that's happened to me, before)
I think that such comment to the AC is less recommended.
Addressing the AC directly should be only in extreme cases where you think that there is something inappropriate in the reviewing process.
The novelty criticism is legit, and for this you also wrote a good rebuttal that the AC suppose to read. And in any case the reviewers that raised it provide high score
R3 case is a classic AC comment, since this review is a garbage, although this case is so obvious that in any case if the AC decent and spent the 5 minutes to read the reviews he will simply ignore his score even without a comment
If everyone writes something to AC I bet they would be super overwhelmed given the crazy number of submissions. So I think it either needs to be short or like Elad said, pointing out something as a complaint
But indeed putting complaint on our best review feels weird...
Maybe we should just chill
i see... i'm thankful R3 raised their score and I'm not inclined to push back more against them
if one of you guys wants to write something, I'm happy to proofread it
woohoo!! acceptance at ICML!!
Congrats everyone! It took a while but we got it π
Thank you to everyone who participated π₯³π
Congratulations @versed flax 
We need an Impact statement for ICML.
What do we think of this?
Impact Statement
The development of large language models (LLMs) has significantly advanced natural language processing, enabling applications ranging from conversational agents to text generation. Our new technique introduces an innovative approach to LLMs by enhancing their ability to emphasize and adhere to given prompts. While this advancement presents exciting opportunities for improving user control and customization in LLM interactions, it also carries substantial risks.
This technique, when misused, has the potential to undermine the alignment mechanisms designed to ensure ethical and safe behavior in chatbots. By prioritizing prompt adherence over alignment protocols, it can facilitate the generation of harmful, biased, or toxic content, posing serious ethical concerns. Such capability could be exploited to bypass content moderation systems, leading to the dissemination of offensive material, misinformation, or other forms of digital harm.
It is crucial to address these risks by developing robust safeguards and ethical guidelines for the deployment of this technique. We emphasize the importance of continued research into alignment and safety measures to mitigate the negative impacts while harnessing the positive potential of enhanced prompt emphasis in LLMs.
Was it written by GPT-4o? Yes. Is it much better than anything I could have done myself? Also yes
Looks great tbh. Are we allowed to use llm to write this?
Well I have the general ideas so it's was used as a "sentence maker". My English isn't that good
me neither. Only Alex can write better than this π
@patent gull my dude, can you give your feedback about the impact statment?
sorry guys my oral proposal is tmrw I'll be able to look at it after that
ok :) good luck!
@patent gull please
btw this our final chance of renaming CFG to CFG-LLM or logits-CFG to disambiguate to both CFG-but-in-diffusion and Context-Free Grammar
What exactly is it you're thinking of renaming? The paper is titled CFG for Language models right
yes. We were cited by a paper who dubbed our approach as "CFG-LLM" and we thought it was better because it was disambiguating a bit
there's nothing more to it
Oh it's "Stay on topic with Classifier-Free Guidance"
I would be down with "Stay on topic with Classifier-Free Guidance for Language Models"
Oh, calling it "CFG-LLM" in the paper?
If you want to I'm fine with it. I don't think it's a big deal either way
Yes :)
CFG-LLM or LLM-CFG?
My professor friend also made the comment to add Large Language Models to the title somehow π€·ββοΈ
Yeah adding Large Language Models makes sense. When it comes to CFG-LLM it might feel like some sort of LLM rather than putting emphasis on CFG though.
nit: the noun is LLM so when we say "...applying CFG-LLM..." it's a little weird
well it's like "<Chain of Thought> <Prompting>" or "Diffusion Models"
the first word modifies the second
oh I mean how we refer to the method rather than the model
And the full phrase is "Classifier Free Guidance for Large Language Models"
so CFG-LLM is a natural abbreviation of that
"applying Classifier free guidance for Large Language Models"?
this one the noun is "Classifier Free Guidance"?
sorry maybe <adjective>-<noun> is too limiting, i'm not a linguist lol. <modifier>-<noun> is more general?
yeah I mean if it is understood that "-" is "for" then we are good. But I felt it's usually not the case
yeah I just mean the full name is <noun> <modifier> and the abbreviation seems like <modifier>-<noun>
CFG is the noun, not LLM IMO
i could see the argument for it
I prefer CFG-LLM but i'm ambivalent, it's up to @versed flax . However, i do dislike logits-CFG
but I guess we can still call our method CFG, and just refer to the paper as CFG-LLM
I genuinely don't see a need to change anything.
I think that in our paper it's very unambiguous that we're taking CFG and adapting it to LLMs
I also think that there's no problem with other people using slightly different wording when referring to our work. That's extremely common: names from papers often don't stick.
I could see an argument for someone who sees the citation not knowing it's an NLP paper based on the title alone
But replacing all instances of "CFG" with "CFG-LLM" in our paper will not improve readability or be the determining factor in whether a reader notices that we're studying LLMs
I agree
i agree with that, too!!
@versed flax here is the re-written impact statement:
While this advancement presents exciting opportunities for improving user control and customization in LLM interactions, it also carries substantial risks. We show that CFG can improve system-level prompt adherence in chatbot settings. However, the opposite is also true. CFG can also improve user-level prompt adherence, potentially undermining alignment mechanisms designed to ensure ethical and safe behavior in chatbots. It might be used to facilitate the generation of harmful, biased, or toxic content, posing serious ethical concerns. Such capability could be exploited to bypass content moderation systems, leading to the dissemination of offensive material, misinformation, or other forms of digital harm.
It is crucial to address these risks by developing robust safeguards and ethical guidelines for the deployment of this technique. We emphasize the importance of continued research into alignment and safety measures to mitigate the negative impacts while harnessing the positive potential of enhanced prompt emphasis in LLMs.```
second paragragh can be condensed to be more like you wrote it originally:
We are spotlight paper of ICML 2024!
Amazing achievement, congratulation everyone
I mean, why not just writing assistants?
wow, that's a deep thread link right there ^
@loud adder do you plan to tweet about CFG's spotlight award from the EAI account? if so, my handle is @AlexanderSpangh
Mine is @Void13950782 (sorry, it used to be a throwaway account of mine ages ago and I didn't care about the user id.....)
@eladlevico
Sure
@versed flax is @Vermeille_
i think that's cooler than a "real" sounding account, lol
Sorry for being totally stupid but uh, when do I have to be there? Conference Sessions only?
you choose when you want to go.
maybe you can sneak into workshops with the conference pass though
Ok so since I have to present the paper, only Conference is mandatory, right?
That is correct
thank you for clarifying
(It's not mandatory since I'll be there and it's only required that some author attend. But I highly recommend it)
Also, in case I forgot to say: if you don't have funding EleutherAI will pay for whatever it is you need to attend. So in this case two plane tickets, lodging Tuesday / Wednesday night, and the cost of the conference pass. If you want to extend you can do so, just make sure to keep all the receipts so it's easy to tell what we're paying for and what we aren't (namely the other nights and the add-ons to the ticket)
That's very nice but I already have a company taking care of it, don't worry
Oh man :/ Iβve been really curtailing my conference travel this summer due to lack of funding
But while I wonβt be at ICML, my twin brother will be! He looks exactly like me, so you all can meet him and pretend heβs me haha
@patent gull that's pretty funny. Definitely into us π or just tell him to come by the poster
Haha i swear that's not my way of saying "I'll be there but don't expect me to say hi."
There is actually an Alex Spangher and a Lucas Spangher, and we're identical, and we both have our PhDs in machine-learning related fields.
He'll be at ICML to share work on ML for Nuclear Fusion that he did during his post-doc at MIT, so be on the lookout for him as well!! πππ
Can confirm. I saw them both at the same time in the same room π
Nice!
Good luck on the presentation!
Good luck with the presentation today!
I hope you will have a great time
It went well thank you :)