can anyone help me with python machine learning? | Smarter Dev | Page 1

wintry lagoon Nov 27, 2024, 6:41 PM

#

Explain problem here bro

#

!rule dm

astral ivyBOT Nov 27, 2024, 6:41 PM

#

Direct Messages

Please keep discussion on the server. It helps you get responses more quickly from more people & with more viewpoints. It also helps protect you from scammers.

We recommend disabling DM's from all public servers.

urban kindle Nov 27, 2024, 6:42 PM

#

i have to make a program that get a sound of person talking and it needs with background vs foreground to find the seconds the person talks
and it wants me to create an ML from the begining no ready dataframes.

jaunty hinge Nov 27, 2024, 6:42 PM

#

What is "it"?

wintry lagoon Nov 27, 2024, 6:42 PM

#

This sounds familiar?

#

Oh it's cause you asked this then left

urban kindle Nov 27, 2024, 6:44 PM

#

i really need help.

jaunty hinge Nov 27, 2024, 6:44 PM

#

jaunty hinge What is "it"?

.

urban kindle Nov 27, 2024, 6:44 PM

#

urban kindle i have to make a program that get a sound of person talking and it needs with ba...

.

urban kindle Nov 27, 2024, 6:44 PM

#

jaunty hinge .

i have to make a program that get a sound of person talking and it needs with background vs foreground to find the seconds the person talks
and it wants me to create an ML from the begining no ready dataframes.

jaunty hinge Nov 27, 2024, 6:44 PM

#

I can read

#

"it wants me to [...]" -> who or what is "it"

urban kindle Nov 27, 2024, 6:45 PM

#

urban kindle i have to make a program that get a sound of person talking and it needs with ba...

i mean to do this job.

#

recognise the seconds a person talks

jaunty hinge Nov 27, 2024, 6:46 PM

#

just use whisperx lol

#

https://github.com/m-bain/whisperX throw that on a semi-decent GPU and be happy

GitHub

GitHub - m-bain/whisperX: WhisperX: Automatic Speech Recognition w...

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) - m-bain/whisperX

#

gives you the recognized texts with the timestamps for each sentence

urban kindle Nov 27, 2024, 6:46 PM

#

i cant use ready i need to make my own and train it.

jaunty hinge Nov 27, 2024, 6:46 PM

#

So presumably this is some sort of college assignment

#

and they presumably also gave you a dataset to work with?

urban kindle Nov 27, 2024, 6:47 PM

#

i need to find a dataset

jaunty hinge Nov 27, 2024, 6:47 PM

#

So you don't need to make your own model?

urban kindle Nov 27, 2024, 6:47 PM

#

and use it to train the

urban kindle Nov 27, 2024, 6:47 PM

#

jaunty hinge So you don't need to make your own model?

I need to make my model

jaunty hinge Nov 27, 2024, 6:48 PM

#

https://www.openslr.org/12

openslr.org

Open Speech and Language Resources.

urban kindle Nov 27, 2024, 6:48 PM

#

i never use Ml and i need some help.

#

the problem is to make the Ml

jaunty hinge Nov 27, 2024, 6:48 PM

#

urban kindle i never use Ml and i need some help.

What did you do in the earlier parts of the semester? 😂

urban kindle Nov 27, 2024, 6:49 PM

#

the lesson is about sound not about Ml

#

the teacher just puts hard exercise

jaunty hinge Nov 27, 2024, 6:49 PM

#

Sounds absolutely insane to have students do an ML model from scratch in some kind of audio class but whatever

urban kindle Nov 27, 2024, 6:49 PM

#

i know

jaunty hinge Nov 27, 2024, 6:50 PM

#

You just need to identify the timestamps?

#

What sounds more likely is that they want you to use some sort of concept you learned in the class (e.g. audio processing), seems highly unlikely that they expect you to just know ML 🤨

urban kindle Nov 27, 2024, 6:52 PM

#

yeah like the person says the word "table" and i need to say he is saying it in 2,5 to 3 second of the sound.

jaunty hinge Nov 27, 2024, 6:54 PM

#

You need to say "the person is speaking from 2.5 to 3 seconds" or do you also need the text content?

urban kindle Nov 27, 2024, 6:55 PM

#

both

jaunty hinge Nov 27, 2024, 6:55 PM

#

You're not training that from scratch

#

hell nah

urban kindle Nov 27, 2024, 6:55 PM

#

what can i use.

#

?

jaunty hinge Nov 27, 2024, 6:56 PM

#

I already recommended whisprx which does exactly what you need

urban kindle Nov 27, 2024, 6:57 PM

#

You are tasked with implementing a system that segments a sentence into words, mandatorily using a background vs foreground classifier of your choice. Given a recording of a speaker, the system should return the time boundaries of the spoken words (in seconds). Additionally, you must provide an accompanying program that plays back the detected words. The number of words in the sentence is not known in advance, but you can assume that there is a small gap of silence between the words.

jaunty hinge Nov 27, 2024, 6:58 PM

#

This is a completely different problem than the one you outlined lol

urban kindle Nov 27, 2024, 6:58 PM

#

Attention!!!: You cannot use convolutional neural networks. The use of ready-made web services or APIs for speech recognition is not allowed. Transfer learning from pre-trained networks is also not allowed. Solutions that violate these rules will receive zero points.

urban kindle Nov 27, 2024, 6:59 PM

#

jaunty hinge This is a completely different problem than the one you outlined lol

can we vc so you can help me a bit?

#

and explain to me?

jaunty hinge Nov 27, 2024, 6:59 PM

#

No

jaunty hinge Nov 27, 2024, 7:00 PM

#

urban kindle You are tasked with implementing a system that segments a sentence into words, m...

See you don't actually need to do transcription

#

they tell you there are no background noises and that you have a pause between words

#

So you just yoink something like http://dx.doi.org/10.1145/2814895.2814926 and are done

urban kindle Nov 27, 2024, 7:01 PM

#

urban kindle Attention!!!: You cannot use convolutional neural networks. The use of ready-mad...

is this official python library?

jaunty hinge Nov 27, 2024, 7:03 PM

#

What

urban kindle Nov 27, 2024, 7:03 PM

#

wait a bit

urban kindle Nov 27, 2024, 7:05 PM

#

jaunty hinge What

A) You are tasked with implementing a system that segments a sentence into words, mandatorily using a background vs foreground classifier of your choice. Given a recording of a speaker, the system should return the time boundaries of the spoken words (in seconds). Additionally, you must provide an accompanying program that plays back the detected words. The number of words in the sentence is not known in advance, but you may assume that there is a small silence interval between words.

You must implement and compare the performance of the following classifiers: Least Squares, SVM, RNN, and a 3-layer MLP (specify the number of neurons per layer). The comparison should be conducted as is typical for binary classification systems.

B) From the detected words, calculate the speaker's average fundamental frequency.

You must explain which data were used during the testing and training of the system. If they are your own, explain how you created them; if they are open source, explain how they were utilized.
Try to ensure that the system is speaker-independent and as robust as possible to variations in speaker characteristics.

jaunty hinge Nov 27, 2024, 7:05 PM

#

See there you have it

#

they want you to implement a very basic thing and they tell you what to implement

urban kindle Nov 27, 2024, 7:06 PM

#

can you give me a explanation of what i need to do?

jaunty hinge Nov 27, 2024, 7:07 PM

#

https://www.youtube.com/watch?v=WAxfTAy6RS8 there you go here's a nice professor from some Indian university I think?

YouTube

IIT Madras - B.S. Degree Programme

Least Square Classification

least square error, Optimization via normal equation and gradient descent, inference

▶ Play video

urban kindle Nov 27, 2024, 7:08 PM

#

jaunty hinge https://www.youtube.com/watch?v=WAxfTAy6RS8 there you go here's a nice professor...

i really dont understand what i need to do

#

can you give me a general explanatin

jaunty hinge Nov 27, 2024, 7:08 PM

#

What did you do the entire semester 💀

jaunty hinge Nov 27, 2024, 7:08 PM

#

urban kindle can you give me a general explanatin

yea you gotta write code

urban kindle Nov 27, 2024, 7:08 PM

#

the lesson is about sound

jaunty hinge Nov 27, 2024, 7:09 PM

#

You probably ignored some of the requirements then? 💀

#

also never heard of an audio engineering class in a computer science degree

urban kindle Nov 27, 2024, 7:10 PM

#

i really cant understand what i need to do.

jaunty hinge Nov 27, 2024, 7:10 PM

#

Have you tried asking your professor for some help?

#

Cuz either you're lying or they're insanely incompetent lol

wintry lagoon Nov 27, 2024, 7:12 PM

#

💀

#

Get his ass @jaunty hinge

urban kindle Nov 27, 2024, 7:14 PM

#

now i understand what i need to do thanky you

#can anyone help me with python machine learning?