#bengali-ai-speech

1 messages · Page 1 of 1 (latest)

low helm
#

Hi everyone! @here

This is Sushmit, one of the hosts of this competition. Really nice to see so many people working on this challenge. I, along with my teammates from Bengali.AI are available here and in the Kaggle discussion threads to answers any of your questions / concerns. Feel free to reach out!

Some pointers:
Bengali orthography is complex due to the use of diacritics and connectors. We hosted another Kaggle competition in 2019 on a dataset based on this issue. Ref: https://arxiv.org/ftp/arxiv/papers/2010/2010.00170.pdf, https://www.kaggle.com/c/bengaliai-cv19
Also there's the issue of unicode ambiguities in Bengali (there are multiple ways of writing the same thing ). We provide a python module to deal with this: https://arxiv.org/pdf/2306.01743.pdf
Insights regarding some out-of-distribution domains: https://arxiv.org/ftp/arxiv/papers/2305/2305.09688.pdf

willow loom
#

Thanks for reaching out and for the pointers, @low helm!

winged swallow
#

Hello

keen kestrel
austere spear
#

Hey! i'm working on the bengali competition. Is there a way to figure out which words a particular model isn't able to classify well

rugged dirge
#

Hi, I'm trying to train a wav2vec model for a contest, but after 800 steps the loss reaches NaN and WER 1. Does anyone know why this is happening?

silent granite
#

It might be that your LR is too high

dull tulip
rugged dirge
floral hamlet
#

Hi, I'm new to speech recognition. I have been trying ASR transformer model. Tried different audio prepossessing (Normalization, Silent trimming) But could not feed more than 100k data with max_text_length = 30. Because It crashes in colab due to ram capacity (tried batch_size = 4). What other processioning can i do before feeding the data? another thing is converting mp3 files to wav (also used parallel processing, it took 9hrs). Thanks.

candid jetty
#

Hi, I am completely new to ML. Anyone cares to carry me arround?

smoky garden
#

Hi everyone each time i make a sbumission i see a submission error and subuission.csv not found how can i tackle this

dusk gust
tepid rover
#

Hey guys, I am a notebooks master and I am looking to improve in competitions. I am currently 54th place in the competition and looking for someone to work together with to do better. Let me know if interested.

gritty grove
tepid rover
#

@gritty grove For sure

gusty condor
#

Hello, guys am looking for a teammate for this compettition.

charred river
#

Hello!
Hello, I have participated in Bengali.AI competition. I ran on CPU and my notebook has been running for more than 5 hours now. Is this normal?
If not, What should I do. This is my first competition