#gemma-language-tuning
1 messages · Page 1 of 1 (latest)
Very cool competition!
I wanted to ask whether we could fine-tune a language that's not there in the list.
Hey anyone wants to form a team with me?
I'm a beginner if you don't mind
Hi. This is my first Kaggle competition I'm enrolling in but I've been really fascinated with translation especially since my Filipino/Tagalog is conversational at best and translation tools have been lacking for Filipino. I've dabbled in fine tuning before but this contest brings a new interest for me.
I can do it this contest by myself, but I can definitely see where working on a team would also be useful. I'm building the dataset right now and fine tuning in parallel. I've got some fine tunes already and they're showing promising results. Still working on more data to improve. Let me know if you're interested to teaming up and what you can bring to the table.
@amber knot so sorry for the ping, but is this invalid?
it's probably best to ask in the competition forum so that the host is able to answer directly (unless someone else here knows)
Thank you Meg. I'll try that.
i see 2b and 9b models in the models tab at https://www.kaggle.com/competitions/gemma-language-tuning/models
should we use just 2b/9b or can we use 27b as well ?
- we have 2 spots on the team , will be doing it over telugu/hindi depending on other 2 , feel free to reach out
Hi, I'm a 16 year old who's been doing computer vision and llm projects/competitions for a bit now and am just getting started with finetuning Gemma for Mandarin and Chinese culture. I would love to work along with others for this competition and learn from how different people approach ml competitions. If anyone's interested in teaming up please let me know!
Also, I'm fine with switching to a different preferred language, but mandarin would be my top choice.
Hi boys, I'm working with a Friend in a English to Spanish finetune, let anyone want to join in?
I'm from Vietnam and working on English <=> Vietnamese finetune, wanna join? Things we are doing https://huggingface.co/Symato
will gemma fine tuned for math can be submitted?
Which of the Gemma models we need to use,2B, 9B or 27 B?
If someone helps it would be greate
whichever is more applicable to the usecase you wish to demonstrate.
i too ve this question
I did not received any answer for this
Got you, so theres no issue with using the higher models then?
I was also thinking along the line of the language not on the list. My native language Chibemba . If I can get some one interested.
I asked regarding this in the Kaggle discussions section of the Competition, but got no answer, even I'm working on a language that hasn't been fine-tuned before.
how to authenticate if using collab i already accepted consent at kaggle gemma model page
Thanks for the heads up. I hope it is allowed!!
I think any language is important and the method can work across languages so if people don't interested in your lang - just because they don't know it yep - they may and will interest in your methods
Language specific problems also interesting, how a tokenization method affect languages differently is one example
I'm quite curious since the most popular BPE is made for English and alike langs
Hello, i'm new here, i need to join one team
@warped snow refer this notebook for authentication by gemma, pinned in competition code
Its working fine on kaggle notebook but authentication issues when try to use in collab notebook
Then you refer any random yt video
I was wondering about some guidance to compete because this is my first time in such a competition
Hi everyone ,
I’m currently working with the Gemma Instruct 2B EN model from keras_nlp and have been fine-tuning it using LoRA. However, I’m curious about other fine-tuning techniques that could be effective with this model. If anyone has experience with raw, hands-on code or can point me to resources or repositories where alternative methods (like full fine-tuning, adapter layers, etc.) are used with this model, it would be really helpful.
Any code examples or practical insights are greatly appreciated. Thanks in advance for your help!
some methods I know:
- full finetune (2b can fit consumer gpus, 24g vram for example)
- block expanse https://arxiv.org/abs/2401.02415, only fintune some newly added layers, frozen the rest
- mix of them: some frozen, some use lora, some use full-finetune ...
- https://huggingface.co/docs/peft/developer_guides/lora#memory-efficient-layer-replication-with-lora is nice too ...
Thank you.. I really appreciate your help.
Hi everyone, is there anyone working on some languages such as Urdu, Sindhi, Balochi in NLP domain I want to connect let me know in dm
hi, is different arabic dialects allowed in this contest or just modern standard arabic?
he guys
hey 👋🏻 anyone out there looking for a teammate for this competition !
Hello, is it possible to use Gemma models to create a sequence to sequence model?
I fine-tuned gemma on brain rot language for fun - https://www.kaggle.com/code/shreeshabhat1004/finetuning-gemma-on-brain-rot-language-for-fun
Love the idea, but it sorta answers everything with ohio sigma rizz vibes? (Tbf, i do as well, but is that the behavior u want?)
it's actually because I finetuned it on a very small dataset and The dataset happens to be containing whole lot of Ohio sigma rizz... Or maybe if I change the output template it's gonna give out more than Ohio skibidi rizz
yeah :)) consider filtering and using the 4chan or reddit dataset, im sure youll find the brainrottest texts :()
sure :😁
lol
Hello guys, I wanted to ask if sanskrit is one of the allowed language or not, I see a lot of entries that finetune on sanskrit but it was not on the list
Hi just wanna ask how much gpu vram does it take to finetune gemma 2? Just so i can gauge resources needed
Im seeing 16GB for 2B params??!
I am trying to do the same thing but getting resource ran out error
are we allowed to use any other methods ALONG with finetuning ? Like RAGs or pretraining
Pretraining im not sure, that would be quite insane. But okay on the RAG
But ya know, if u pretrain on a pretrained model, you're finetuning, so who cares lmao
Hi Do we need to use Gemma from models here https://www.kaggle.com/models/google/gemma-2/PyTorch/gemma-2-9b-pt or can we also use it with Unsloth Gemma model
Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.
Same question !!!!
Guys like how long did it take you to finish fine-tuning your model?
mine took around 6 hours, the dataset was around 6 mb
Ah I see thanks man
Anyone tried multi dialect or multi lingual fine tuning ?
Did anyone tried fine tuning on books ?
Hi anyone knows how to unload and reload a lora in Keras?
your_model.backbone.enable_lora(rank=lora_rank) enables lora
did you mean you want to load the lora weights into the model?
your_model.backbone.load_lora_weights(filepath) should do it
is it strictly necessary to use pretrained gemma 2b model ? can we use opensource already fine tuned models(which are not participants in the comp) for further fine tuning for the comp ??
No i meant i already load_lora_weights once. How do i remove the lora weights, then apply a new one?
All the best everyone, I'm not really participating in this competition but I'm looking forward to everything you folks have built.
its last day today, all the best to all folks here
https://www.kaggle.com/code/shreeshabhat1004/non-binary-inclusive-gemma-finetuning Our submission focuses on inclusive communication with non binary people. We aim to constantly improve it, even after the competition is over. Suggestions or criticisms are most welcome. Thank you!
When will winners announce
It will probably take few weeks for them to assess the notebooks and announce winners
I have a query
Before the deadline, I made my notebook (https://www.kaggle.com/code/ayeshaimr/gemma-2-urdu-adaptation-a-health-centric-approach) public as well as the datasets used for finetuning and the finetuned model was also published on kaggle models. However on the competition page, in the code and models section, i don't see my notebook and model present (in "your work") because I didn't specifically link them to the competition (I didn't know how to do that or the fact that this was necessary). Is my submission still valid? All other conditions are fulfilled. Just hella worried now because my teammates and I spent a lot of effort and time on it and don't want it invalidated due to this simple reason.
Also if anyone knows how I can attach my submitted notebook (without changes) to the competition now, please let me know.
@modern cove
You should have added the competition dataset to link the notebook
For the model, I don't think that is necessary
https://www.kaggle.com/code/williamalabi/lingogemrag My submission focuses on the combined use of LoRA, ReFT and RAG for efficient language model adaptation
There was no dataset though?
I’m just wondering if my submission is still valid :((
I worked on Swahili variations of Gemma2, we released a couple of models, I would really apreciate you checking this out and hear your feedback 🙂
https://www.kaggle.com/code/alfaxadeyembe/introducing-gemma-2-swahili
Yes you still have to add it
Search for the competition name in the datasets section and add it
thank you! I’ll try that
your submission must be valid because they didnt mention explictly in description of competition, to create a notebook in the competition page itself
Yeah I hope that is indeed the case!
When will the winner be announced?
It's there.
To participate in this competition, you must create and share a public Kaggle Notebook that demonstrates how to use the Gemma model for various languages and/or cultural contexts AND publish your variant to Kaggle models. Your Kaggle Notebook must be made public (along with any underlying data sources) and it should be attached to the official competition dataset.
How do you expect them to see your submission? They won't go to your profile to check. Except maybe if you made a discussion post about it
Oh there was a google form to be submitted, you had to include notebook url and model url in it
Yeah true
did any of you try tuning 9b or 27b models? Just curious
I did but not on Kaggle
thats nice