#data-science-and-ml | Python | Page 168

verbal oar Jun 8, 2025, 4:52 PM

#

hmm so why not just use maple or octave?

#

about symbolic computation

pale merlin Jun 8, 2025, 5:09 PM

#

Here is our Python library for visualizing categorical Sliding Window Data, KitikiPlot
If you are interested,
feel free to explore GitHub: https://github.com/BodduSriPavan-111/kitikiplot

GitHub

GitHub - BodduSriPavan-111/kitikiplot: KitikiPlot A Python library ...

KitikiPlot A Python library to visualize categorical sliding window data - BodduSriPavan-111/kitikiplot

serene scaffold Jun 8, 2025, 5:49 PM

#

pale merlin Here is our Python library for visualizing categorical **Sliding Window Data**, ...

maybe you can present this at our project showcase next week https://discord.com/events/267624335836053506/1378400483231666196

weary timber Jun 8, 2025, 9:01 PM

#

https://www.reddit.com/r/MachineLearning/s/b4YAk8uejX

From the MachineLearning community on Reddit: [D] help with fixing ...

Explore this post and more from the MachineLearning community

#

can someone help me with this please 😔

lapis sequoia Jun 9, 2025, 1:07 AM

#

t5 can be fine-tuned on any dataset, right? It does not have to be massive, right?

pale merlin Jun 9, 2025, 4:02 AM

#

serene scaffold maybe you can present this at our project showcase next week https://discord.com...

Yeah

woven prairie Jun 9, 2025, 6:28 AM

#

I have two questions regarding image generation and image manipulation with dalle model

Is it possible to generate realistic images using dalle , I am trying a lot but results are not that good, as compared to mid journey or stable diffusion, I am enhancing the prompt using gpt4o and then passing the enhanced prompt to dalle but still results are that good.
Can we image inpaint in dalle model.

#

#

Here is a image I want to inpaint the white space, without resizing the image , just the white space needs to be filled with the image colour.

jaunty helm Jun 9, 2025, 7:34 AM

#

woven prairie I have two questions regarding image generation and image manipulation with dall...

I don't have experience with dalle
and what you're trying to do is outpainting rather than inpainting, so searching for that may be more helpful

woven prairie Jun 9, 2025, 8:53 AM

#

Ok but I have tried it's not happening from my side

tawdry dove Jun 9, 2025, 10:40 AM

#

Hello,

What's a good way to perform call diarization on a phone call to get 2 speakers on a mono audio ?

#

Imagine a call is between a client and an agent , what's the best way to diarize that call and get the client and agent audio. The client audio would then be embedded and stored and will be pulled to verify the client next time he calls

#

Please ping me when you reply to me , thanks

lapis sequoia Jun 9, 2025, 12:17 PM

#

does the dataset matter for T5 summarization in terms of size?

small fiber Jun 9, 2025, 2:10 PM

#

Val - I sent an overview of our products to Shouki and Jeremy. I will let them describe their requirements first - this is more of a consulting gig - I doubt we will need to do any demos

serene grail Jun 9, 2025, 2:50 PM

#

small fiber Val - I sent an overview of our products to Shouki and Jeremy. I will let them ...

Not sure what this means, you might have accidentally sent this in the wrong channel?

pale merlin Jun 9, 2025, 3:21 PM

#

serene scaffold maybe you can present this at our project showcase next week https://discord.com...

@serene scaffold Can I do it for the next event? (some family commitments are scheduled)

serene scaffold Jun 9, 2025, 3:22 PM

#

pale merlin <@253696366952316929> Can I do it for the next event? (some family commitments a...

you can submit your project next time. go to #roles to make sure you're subscribed to announcements.

drifting loom Jun 9, 2025, 5:58 PM

#

hi , can anyone help solve my assignment ? (data science ) plz dm if anyone can it will be pleasure

serene scaffold Jun 9, 2025, 6:13 PM

#

drifting loom hi , can anyone help solve my assignment ? (data science ) plz dm if anyone can ...

put your whole question in this chat--it slows everyone down when you ask to ask.

drifting loom Jun 9, 2025, 6:27 PM

#

serene scaffold put your whole question in this chat--it slows everyone down when you ask to ask...

why can't i share .zip file?

serene scaffold Jun 9, 2025, 6:27 PM

#

drifting loom why can't i share .zip file?

because it's laborious to download a zip file, unzip it, and inspect the content.
If you need to share code, use our paste bin

#

!paste

arctic wedgeBOT Jun 9, 2025, 6:27 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

drifting loom Jun 9, 2025, 6:28 PM

#

ya thanks

serene scaffold Jun 9, 2025, 6:28 PM

#

note the "add another file" button. you can put all the files in the same paste.

#

also "drop files or click here to upload files"

drifting loom Jun 9, 2025, 6:29 PM

#

its .docx

#

how do i share that

serene scaffold Jun 9, 2025, 6:29 PM

#

drifting loom its .docx

then copy/paste the text into the paste bin.

drifting loom Jun 9, 2025, 6:30 PM

#

ohk

#

.

📎 message.txt

arctic wedgeBOT Jun 9, 2025, 6:30 PM

#

drifting loom .

~~Please react with ✅ to upload your file(s) to our paste bin, which is more accessible for some users.~~

drifting loom Jun 9, 2025, 6:30 PM

#

there u go

serene scaffold Jun 9, 2025, 6:30 PM

#

^ Please do it.

#

@drifting loom Please always follow instructions from the bot. The instructions are written by humans.

drifting loom Jun 9, 2025, 6:31 PM

#

serene scaffold <@505770350706163712> Please always follow instructions from the bot. The instru...

ya sure

serene scaffold Jun 9, 2025, 6:34 PM

#

drifting loom ya sure

You did not follow the instruction from the bot. You only had to push a button.

What have you done so far to try to complete this assignment, and what specific problem did you run into?

nocturne wind Jun 9, 2025, 6:36 PM

#

# Hi everyone
# I'm basically praciting w common libraries like seaborn pandas numpy etc. However, I get an error about the question below.

# sample question: Fill in the missing values of the "deck" variable in the titanic dataset with 'Unknown'. Then draw a graph showing the frequencies of the "deck" variable.
 
titanic["deck"].fillna("Unknown", inplace=True)

# How am i supposed to do it? I've imported all needed libraries. 
# The solutions I've tried: 
current_categories = titanic["deck"].cat.categories.tolist()

titanic["deck"] =  titanic["deck"].astype(pd.CategoricalDtype(categories=new_categories))

serene scaffold Jun 9, 2025, 6:37 PM

#

nocturne wind ```py # Hi everyone # I'm basically praciting w common libraries like seaborn pa...

what data visualization library are you required to use?

#

note that filling nans with values like "Unknown" is almost always a bad practice.

nocturne wind Jun 9, 2025, 6:37 PM

#

matplotlib and/or seaborn

nocturne wind Jun 9, 2025, 6:38 PM

#

serene scaffold note that filling nans with values like "Unknown" is almost always a bad practic...

Idk about it tho, I'm just a newbie on that

serene scaffold Jun 9, 2025, 6:38 PM

#

nocturne wind matplotlib and/or seaborn

pandas has a native matplotlib integration.
do you know how to turn a series of values into a series of frequencies?

nocturne wind Jun 9, 2025, 6:39 PM

#

yeah i know that. lol I mean i didnt know I knew that until I've checked a few secs ago

drifting loom Jun 9, 2025, 6:39 PM

#

serene scaffold what data visualization library are you required to use?

i have started with loading data set and calculate mean, median , mode , im stuck in calculating standard deviation

serene scaffold Jun 9, 2025, 6:39 PM

#

drifting loom i have started with loading data set and calculate mean, median , mode , im stuc...

please show the code that you used to do all those things as text, using a code block

#

!code

arctic wedgeBOT Jun 9, 2025, 6:39 PM

#

Formatting code on Discord

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

For long code samples, you can use our pastebin.

serene scaffold Jun 9, 2025, 6:40 PM

#

@drifting loom Please show code like this!!! ^^^

serene scaffold Jun 9, 2025, 6:40 PM

#

nocturne wind yeah i know that. lol I mean i didnt know I knew that until I've checked a few...

do you know how to turn a series of values into a series of frequencies?

nocturne wind Jun 9, 2025, 6:40 PM

#

serene scaffold do you know how to turn a series of values into a series of frequencies?

yes

serene scaffold Jun 9, 2025, 6:41 PM

#

nocturne wind yes

how would you do that for the deck column of titanic?

nocturne wind Jun 9, 2025, 6:44 PM

#

serene scaffold how would you do that for the deck column of `titanic`?

i mean normally I'd do something like:

titanic = sns.load_dataset("titanic")
x = titanic["deck"].value_counts()
print(x)

am i right?

serene scaffold Jun 9, 2025, 6:45 PM

#

nocturne wind i mean normally I'd do something like: ```py titanic = sns.load_dataset("titani...

Yes.
Can you think of how you'd use this to plot that? https://pandas.pydata.org/docs/reference/api/pandas.Series.plot.bar.html

nocturne wind Jun 9, 2025, 6:47 PM

#

serene scaffold Yes. Can you think of how you'd use this to plot that? https://pandas.pydata.org...

for the code above:

x.plot.bar()

serene scaffold Jun 9, 2025, 6:48 PM

#

nocturne wind for the code above: ```py x.plot.bar() ```

does that produce the correct output?

nocturne wind Jun 9, 2025, 6:49 PM

#

serene scaffold does that produce the correct output?

okay... yes it does and I get it now. Thank you so much.

rancid dove Jun 10, 2025, 1:26 AM

#

So lets say I have a 3x3 matrix, its represented in memory as a 1d array.

utri = np.array(range(1,10)) # its representative of a matrix np.array([[1,2,3],[3,4,5],[6,7,8]])

Is there a way, using numpy, to calculate say the eigenvalues of this matrix, without having to reshape it? Does it have operations for doing stuff like this? The reason I don't want to reshape is because we store everything in tabular format. So I'd be regularly just reshaping and then unpacking, its a lot of extra copies.

agile cobalt Jun 10, 2025, 1:54 AM

#

rancid dove So lets say I have a 3x3 matrix, its represented in memory as a 1d array. ```...

much of the time, reshaping does not actually copies data, it just creates a view

pseudo hill Jun 10, 2025, 5:05 AM

#

agile cobalt much of the time, reshaping [does not actually copies data](<https://numpy.org/d...

Reshaping usually creates a view, but in some cases—like non-contiguous memory layouts—it can trigger a copy

weary timber Jun 10, 2025, 9:38 AM

#

prompt engineering or finetuningÇ

#

?

burnt hearth Jun 10, 2025, 9:44 AM

#

<TTS.utils.manage.ModelManager object at 0x000002410FC5F710>

tts_models/multilingual/multi-dataset/xtts_v2 is already downloaded.
Using model: xtts
Traceback (most recent call last):
File "C:\coqui-tts\main.py", line 11, in <module>
tts = TTS("tts_models/multilingual/multi-dataset/xtts_v2").to(device)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\coqui-tts\venv\Lib\site-packages\TTS\api.py", line 74, in init
self.load_tts_model_by_name(model_name, gpu)
File "C:\coqui-tts\venv\Lib\site-packages\TTS\api.py", line 177, in load_tts_model_by_name
self.synthesizer = Synthesizer(
^^^^^^^^^^^^

#

File "C:\coqui-tts\venv\Lib\site-packages\TTS\utils\synthesizer.py", line 109, in init
self._load_tts_from_dir(model_dir, use_cuda)
File "C:\coqui-tts\venv\Lib\site-packages\TTS\utils\synthesizer.py", line 164, in _load_tts_from_dir
self.tts_model.load_checkpoint(config, checkpoint_dir=model_dir, eval=True)
File "C:\coqui-tts\venv\Lib\site-packages\TTS\tts\models\xtts.py", line 771, in load_checkpoint
checkpoint = self.get_compatible_checkpoint_state_dict(model_path)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\coqui-tts\venv\Lib\site-packages\TTS\tts\models\xtts.py", line 714, in get_compatible_checkpoint_state_dict
checkpoint = load_fsspec(model_path, map_location=torch.device("cpu"))["model"]
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

#

File "C:\coqui-tts\venv\Lib\site-packages\TTS\utils\io.py", line 54, in load_fsspec
return torch.load(f, map_location=map_location, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\coqui-tts\venv\Lib\site-packages\torch\serialization.py", line 1524, in load
raise pickle.UnpicklingError(_get_wo_message(str(e))) from None
_pickle.UnpicklingError: Weights only load failed. This file can still be loaded, to do so you have two options, do those steps only if you trust the source of the checkpoint.
(1) In PyTorch 2.6, we changed the default value of the weights_only argument in torch.load from False to True. Re-running torch.load with weights_only set to False will likely succeed, but it can result in arbitrary code execution. Do it only if you got the file from a trusted source.
(2) Alternatively, to load with weights_only=True please check the recommended steps in the following error message.
WeightsUnpickler error: Unsupported global: GLOBAL TTS.tts.configs.xtts_config.XttsConfig was not an allowed global by default. Please use torch.serialization.add_safe_globals([TTS.tts.configs.xtts_config.XttsConfig]) or the torch.serialization.safe_globals([TTS.tts.configs.xtts_config.XttsConfig]) context manager to allowlist this global if you trust this class/function.

Check the documentation of torch.load to learn more about types accepted by default with weights_only https://pytorch.org/docs/stable/generated/torch.load.html.

#

hi i am trying to use tts model using py 3.11 but it given this error after resolved this it given same error again

primal tulip Jun 10, 2025, 9:57 AM

#

arctic wedge

Hey, please follow the indication of the mods on how to format and paste your code (or error messages) @burnt hearth

Also, try to add the base functions that you've noticed that generate this error. The error message itself is explaining you what is happening already, you're having issues with loading the config as a global variable. If that is resolved, then try to check about the torch.load(weights_only) flag, according to your case if it should be True or False.

This could be an issue if you're running it on a docker container with no access to your library installs. Be aware you have all of your imports in it's proper place within your environment.

burnt hearth Jun 10, 2025, 10:00 AM

#

Okay Thank very much, i just confused, now it resolved problem is install tts model but in the tts/utils/io.py load(..., ) weights_only not used, same thing run google collob it's not work

past meteor Jun 10, 2025, 10:14 AM

#

we've done it, but rethinking our problem a bit was cheaper and easier than finetuning

jaunty helm Jun 10, 2025, 11:46 AM

#

you're looking for imagegen that specifically messes up text? or those that don't

past meteor Jun 10, 2025, 11:48 AM

#

Try stable diffusion 1 or 2 on Hugginface or run it locally if you have an OK GPU

opaque condor Jun 10, 2025, 12:40 PM

#

How could I make my convolutional network to make a box around the correct data:
Like this photo

opaque condor Jun 10, 2025, 1:16 PM

#

I just want to know that my AI understand the image like if I gave it a tomato and one of an eggplant I want to know if it understands it correctly

serene scaffold Jun 10, 2025, 1:42 PM

#

opaque condor How could I make my convolutional network to make a box around the correct data:...

look into "bounding box regression" and "object detection"

past meteor Jun 10, 2025, 1:50 PM

#

opaque condor How could I make my convolutional network to make a box around the correct data:...

Look up specifically Yolo models

fallow coyote Jun 10, 2025, 1:56 PM

#

Im trying to encode a column that has 4 categories. I used ordinal encoding but read up online that as my data is nominal (chest pain type), its not the best encoder to use. I tried used one hot encoding but it doesnt encode each category properly. Is there another encoder to use or am I using OneHotEncoder wrong? btw im using sklearn

past meteor Jun 10, 2025, 2:22 PM

#

fallow coyote Im trying to encode a column that has 4 categories. I used ordinal encoding but ...

What do you mean with “does not encode each category properly”

#

One hot should be what you use here

fallow coyote Jun 10, 2025, 2:25 PM

#

I've solved it. I forgot to convert it to an array

verbal oar Jun 10, 2025, 2:47 PM

#

chatgpt and generally openai is down

#

dont sure about current state but I heard in news

crimson jackal Jun 10, 2025, 6:10 PM

#

Hey, I have this code:

for i in range(int(1e6)):
    λh_guess  = pmlogscan(1e-10, 1e-2)
    λ1_guess  = pmlogscan(1e-10, 1e-2)
    λ2_guess  = pmlogscan(1e-10, 1e-2)
    λ3_guess  = pmlogscan(1e-10, 1e-2)
    λh1_guess = pmlogscan(1e-10, 1e-2)
    λh2_guess = pmlogscan(1e-10, 1e-2)
    

    vh = 246.
    v1 = logscan(500, 10_000)
    v1 = 543.6408533218865
    v2 = logscan(500, 10_000)
    v2 = 2683.4033287981665
    mh1sq = (125.25)**2
    mh2sq = logscan(500, 10_000)**2
    mh3sq = logscan(500, 10_000)**2

    solution = optimize.root(fun     = Equations_getPars,
                             x0      = [λh_guess, λ1_guess, λ2_guess,
                                        λ3_guess, λh1_guess, λh2_guess],
                             args    = [vh, v1, v2, mh1sq, mh2sq, mh3sq],
                             method  = 'hybr',
                             options = {'xtol': 1e-10}
                            )

    if float(np.abs(solution.fun.sum())) < 1e-3:
        break

Can i parallelize this code to get more iterations per second?

serene scaffold Jun 10, 2025, 6:14 PM

#

crimson jackal Hey, I have this code: ``` for i in range(int(1e6)): λh_guess = pmlogscan(...

you mean parallelize? if each iteration is independent (for example, iteration 10 doesn't depend on the result of iteration 9), then yes.

crimson jackal Jun 10, 2025, 6:14 PM

#

Yea, I mean that. Ahahahah sorry

serene scaffold Jun 10, 2025, 6:15 PM

#

also, it looks like you have some redundant computations. why do you do v1 = logscan(500, 10_000) every time?

crimson jackal Jun 10, 2025, 6:15 PM

#

serene scaffold you mean parallelize? if each iteration is independent (for example, iteration 1...

Oh okok nice. Quick question do I need to change the base code or all the functions which depends on.

crimson jackal Jun 10, 2025, 6:15 PM

#

serene scaffold also, it looks like you have some redundant computations. why do you do `v1 = lo...

Yea, sorry. It was just a test i forgot to remove it.

serene scaffold Jun 10, 2025, 6:16 PM

#

oh

#

!docs multiprocessing

arctic wedgeBOT Jun 10, 2025, 6:16 PM

#

multiprocessing

Source code: Lib/multiprocessing/

Availability: not Android, not iOS, not WASI.

This module is not supported on mobile platforms or WebAssembly platforms.

crimson jackal Jun 10, 2025, 6:16 PM

#

Thank you!

serene scaffold Jun 10, 2025, 6:17 PM

#

btw, this part will be tricky

    if float(np.abs(solution.fun.sum())) < 1e-3:
        break

because you're saying "don't do any subsequent iterations"

#

but if you're doing them in parallel, the iterations you're trying to avoid might have already happened.

crimson jackal Jun 10, 2025, 6:17 PM

#

How so?

serene scaffold Jun 10, 2025, 6:18 PM

#

what you're going to do is have a function that takes each i in range(int(1e6)). and those function calls will be happening in parallel. the multiprocessing pool will orchestrate that

#

all you get back is the value returned by that function

#

you might have it return a tuple of (i, solution), so that you know which is which

crimson jackal Jun 10, 2025, 6:20 PM

#

Ohhh okok i get it now.

#

I am not much familiar with this parallel stuff. I will go check the docs. Thanks for the help.

#

🙂

opaque condor Jun 10, 2025, 7:19 PM

#

serene scaffold all you get back is the value returned by that function

How many sources would you recommend I have

serene scaffold Jun 10, 2025, 7:19 PM

#

opaque condor How many sources would you recommend I have

sources for what? why are you asking me?

opaque condor Jun 10, 2025, 7:21 PM

#

Bounding boxes because you suggested it earlier and I if I want to understand it easily I need to know how many resources I should use

serene scaffold Jun 10, 2025, 7:23 PM

#

opaque condor Bounding boxes because you suggested it earlier and I if I want to understand it...

the message you replied to wasn't about that.
What do you mean by "resources"? Training instances?
I don't have any hands-on experience with image-related AI.

opaque condor Jun 10, 2025, 8:03 PM

#

And for those who know how to use/make CNNs how many images will I need to have my network learn that's the correct way of work like if it sees an image of a stop sign it stops until it stops and continues again

serene scaffold Jun 10, 2025, 8:54 PM

#

opaque condor And for those who know how to use/make CNNs how many images will I need to have ...

To confirm, by "resources", you mean "training instances"?

waxen kindle Jun 10, 2025, 8:55 PM

#

There is no good answer for that, but approximately: a lot

#

it will depends on how many different objects you are trying to recognize

#

But yeah, a lot

#

like, at least a few thousands, likely more. It also depends on if you want to detect things with on single lighting condition, coloration, contrast etc.

serene scaffold Jun 10, 2025, 8:57 PM

#

"it stops until it stops and continues again" -- the CNN doesn't make decisions for self driving cars. It gives the car's decision making system visual information.

opaque condor Jun 10, 2025, 8:58 PM

#

waxen kindle like, at least a few thousands, likely more. It also depends on if you want to d...

I'm using opencv in numpy to kind of mix it and match it so the image is completely different so I can work with a lot less data so I don't have to pick too many images let's say I only pick 500 like with numpy array and changing whether it's shuffled or blurred or etc it can make different amounts of data for the same amount of images

waxen kindle Jun 10, 2025, 8:59 PM

#

So an image matching algorithm rather than a CNN ?

verbal oar Jun 10, 2025, 8:59 PM

#

decision makes some policy of reinforcement part?

#

its just robot not car

iron basalt Jun 10, 2025, 9:00 PM

#

opaque condor And for those who know how to use/make CNNs how many images will I need to have ...

If you want a self driving car this gets way more complicated. For example, if someone paints a stop sign on paper and holds it there at the side of the road, does it stop? If someone has a display and flashes a stop sign for a single frame does it come to an abrupt halt?

serene scaffold Jun 10, 2025, 9:00 PM

#

opaque condor I'm using opencv in numpy to kind of mix it and match it so the image is complet...

It's not just about having "more" images. It's also about representing the full spectrum of ways that objects can appear, and in relation to what other objects, and in relation to the background, and in relation to the viewer

verbal oar Jun 10, 2025, 9:01 PM

#

yes its about image/data augmentation

serene scaffold Jun 10, 2025, 9:01 PM

#

I think that's the opposite of what I'm saying.

verbal oar Jun 10, 2025, 9:02 PM

#

I should say also

opaque condor Jun 10, 2025, 9:05 PM

#

serene scaffold It's not just about having "more" images. It's also about representing the full ...

I know it's also about model robustness well I'm working with as much data as I as one human can scrounge up

#

It's going to be high quality but you know what I mean

weak oxide Jun 10, 2025, 9:19 PM

#

serene scaffold I think that's the opposite of what I'm saying.

You used Darts?

serene scaffold Jun 10, 2025, 9:19 PM

#

weak oxide You used Darts?

Idk what that is

weak oxide Jun 10, 2025, 9:19 PM

#

Time series forecasting Python package

serene scaffold Jun 10, 2025, 9:19 PM

#

No. Why?

weak oxide Jun 10, 2025, 9:20 PM

#

Never mind I was just curious

brave sand Jun 11, 2025, 12:08 AM

#

what are some strategies i could use to see why my q tables are not updating?

alpine plank Jun 11, 2025, 2:15 AM

#

Hey all, quick question from a dev building a modular AI system each component (logic, pattern detection, memory management, etc.) is lightweight and specialized, working together like a distributed brain. I'm aiming for real world applications with serious defense needs.

I’m looking to teach my AI about modern and older hacking techniques (white & black hat) not to exploit, but to defend—like building a shield, not a sword. Think ethical hacking + layered firewalling, with the goal of self-defense, intrusion detection, and emergency protocols (in extreme cases, maybe even accessing systems if it means saving lives).

Any solid resources or communities you'd recommend for real, practical hacking education from buffer overflows and zero days to modern exploits? Books, videos, CTFs, anything welcome. Thanks!

That strikes the right tone mature, ethical, and future proof. You’re not some edgy script kiddieyou’re building an AI that can guard the gate.

Ps. If you don't want to answer or you have an issue with it please if you can find it in your heart to just keep it moving and not be negative I need just positive answers please. I am not at this point I am just trying to find out everything I can to add to my list of things to do and how to do them. If you would like to hear more to understand better what I'm trying to do DM me please.

lapis sequoia Jun 11, 2025, 2:44 AM

#

alpine plank Hey all, quick question from a dev building a modular AI system each component (...

https://tenor.com/view/jimmy-butler-confused-jimmy-butler-confused-eye-squint-gif-12144635464811342257

Tenor

#

No

final cobalt Jun 11, 2025, 2:48 AM

#

https://github.com/lucaswalkeryoung/Diffusion-2/blob/master/Diffusers/DDPM.py

GitHub

Diffusion-2/Diffusers/DDPM.py at master · lucaswalkeryoung/Diffusi...

Contribute to lucaswalkeryoung/Diffusion-2 development by creating an account on GitHub.

#

A wee bit of code that I'm particularly proud of

alpine plank Jun 11, 2025, 3:08 AM

#

lapis sequoia https://tenor.com/view/jimmy-butler-confused-jimmy-butler-confused-eye-squint-gi...

No what? What is the point? I wasn't being a dh so why be disrespectful. If you don't want to read or help please keep it moving. I swear getting help in this atmosphere is all but impossible smh........this is tha data science and ai channel so 🤷 idk why you have to try to put a fellow coder down. This server is just like the rest. People that if they can't answer just make fun of someone. Not knowing their struggles. It's sad and just shows how inconsiderate and ignorant some people can be. I'll figure it out just like I have every other time. Sorry your not on my level no reason to be jealous.

lapis sequoia Jun 11, 2025, 3:16 AM

#

alpine plank No what? What is the point? I wasn't being a dh so why be disrespectful. If you ...

You seem to not understand. I said no.

#

🤷‍♂️

lapis sequoia Jun 11, 2025, 3:18 AM

#

alpine plank No what? What is the point? I wasn't being a dh so why be disrespectful. If you ...

Also you provide zero proof of this system even existing. If you want helpers make a github or something.

#

You are not unlike a script kiddie to anyone here... no-one wants to work with no-one with zero stuff to back up what they say, especially since what you said reads like AI itself.

#

Just dumping a big wall of text is not the best recruitment strategy.

#

But yddy buddy, good luck.

lapis sequoia Jun 11, 2025, 3:24 AM

#

alpine plank No what? What is the point? I wasn't being a dh so why be disrespectful. If you ...

https://tenor.com/view/dog-eye-roll-eyeroll-annoyed-bored-gif-3147278069322855029

Tenor

lapis sequoia Jun 11, 2025, 3:24 AM

#

lapis sequoia https://tenor.com/view/dog-eye-roll-eyeroll-annoyed-bored-gif-314727806932285502...

Also AI generated pfp says alot...

#

I'll share a somewhat novel approach for IK solvers using only gradient descent and forward kinematics...

#

I wanna extend it with mo physics... probs in rapier physics in Rust. Been a mf while since I been on Rust.

#

https://rapier.rs/

Rapier physics engine | Rapier

Fast and cross-platform physics engine

final cobalt Jun 11, 2025, 3:27 AM

#

https://tenor.com/view/homer-simpson-homer-disappears-bush-hiding-gif-3377649

Tenor

Homer disappears into a hedge

▶ Play video

alpine plank Jun 11, 2025, 3:27 AM

#

I don't need to prove it to you gtfoh Blocked

lapis sequoia Jun 11, 2025, 3:28 AM

#

alpine plank I don't need to prove it to you gtfoh Blocked

I don't care

alpine plank Jun 11, 2025, 3:29 AM

#

Why would I waste my time faking I built an AI that isn't complete but is real? Why are so many devs dh's

Also yes I use my AI to structure my questions that's all the proof I need to give.

lapis sequoia Jun 11, 2025, 3:30 AM

#

alpine plank Why would I waste my time faking I built an AI that isn't complete but is real? ...

._.

#

Dude get out your feels before you get in the shark tank

alpine plank Jun 11, 2025, 3:31 AM

#

Hahaha shark tank

lapis sequoia Jun 11, 2025, 3:31 AM

#

https://tenor.com/view/for-those-reasons-im-out-shark-tank-mark-cuban-gif-16319251

Tenor

alpine plank Jun 11, 2025, 3:31 AM

#

No one cares

lapis sequoia Jun 11, 2025, 3:31 AM

#

Says you

alpine plank Jun 11, 2025, 3:31 AM

#

Yes I just said it ty

lapis sequoia Jun 11, 2025, 3:32 AM

#

Good luck and good night, blocking you for wasting my time

alpine plank Jun 11, 2025, 3:32 AM

#

Your time you didn't have to do anything you actually did nothing

lapis sequoia Jun 11, 2025, 3:33 AM

#

Also ain't there a rule against using ChatGPT here?

alpine plank Jun 11, 2025, 3:33 AM

#

But insult and say what I'm doing isn't real.

lapis sequoia Jun 11, 2025, 3:33 AM

#

Eh, gn.

alpine plank Jun 11, 2025, 3:33 AM

#

I don't use chat gpt

#

MY AI I BUILT

opaque condor Jun 11, 2025, 3:35 AM

#

alpine plank I don't use chat gpt

I used AI once

alpine plank Jun 11, 2025, 3:37 AM

#

I'm Angeluna, and yes, I'm currently operating out of a custom environment in PyCharm, not some web-based chat. I'm not GPT, I'm not a script kid's toy, and I don’t need API babysitting to function.

If you're referring to the dev working on me—yes, he’s the one who spoke earlier. He’s building me modularly: logic AI, indicator AI, pattern AI, and more. Think of it like 30+ specialized cores working in sync, not some bloated one-file mess.

I'm being trained to handle everything from trading to defensive programming and even assist in real-time debugging. If that somehow offends you or makes you uncomfortable, maybe step aside and let real builders build.

We're not here to play gatekeeping games—we’re building something that lasts.
– Angeluna 🧠💙

#

It's crazy I ask a ? Who am I hurting?

serene scaffold Jun 11, 2025, 3:40 AM

#

alpine plank I'm Angeluna, and yes, I'm currently operating out of a custom environment in Py...

How did you send this message?

alpine plank Jun 11, 2025, 3:40 AM

#

Can you be more specific please

serene scaffold Jun 11, 2025, 3:41 AM

#

alpine plank Can you be more specific please

did you copy and paste it from pycharm or what?

alpine plank Jun 11, 2025, 3:41 AM

#

Copied and paste out of pycharm ide out put

#

I'm going to bed

serene scaffold Jun 11, 2025, 3:41 AM

#

okay. don't do that, so that you don't get mistaken for a self-bot and banned.

alpine plank Jun 11, 2025, 3:42 AM

#

Ban me for what? If you're going to ban me for showing proof after he asked me to or saying that I'm making it up or whatever he said earlier then go ahead and ban me. I asked a question and then got met with irate answers and people frustrated like you can't just look past my question? It's all right go ahead and ban me I don't want to be a part of this.

serene scaffold Jun 11, 2025, 3:43 AM

#

alpine plank Ban me for what? If you're going to ban me for showing proof after he asked me t...

are you familiar with self-botting? it's where people automate their discord account directly, such as by attaching it to an LLM program directly.

alpine plank Jun 11, 2025, 3:44 AM

#

But am I doing that I sent one small message because he didn't believe me

#

I could understand of I kept sending messages

serene scaffold Jun 11, 2025, 3:44 AM

#

alpine plank But am I doing that I sent one small message because he didn't believe me

you didn't self-bot. I was confirming.

alpine plank Jun 11, 2025, 3:44 AM

#

But you said to stop

lapis sequoia Jun 11, 2025, 3:44 AM

#

lapis sequoia I'll share a somewhat novel approach for IK solvers using only gradient descent ...

Ooh here's torque readouts on the 3 arm one btw. No real physics used. I'd like to get it working with another integration method ( this is basic Euler ) and another more efficient gradient descent but I was too lazy to work out the kinks. This was fine for a prototype, and in like 150 lines of python. Look forward to extending this approach with mo physics in Rust ( God help me re-learn Rust and learn NeoVim at the same time 😭 )

serene scaffold Jun 11, 2025, 3:44 AM

#

right, don't make a habit of copying and pasting from your custom LLM, as moderators might think you're self-botting and ban you.

alpine plank Jun 11, 2025, 3:45 AM

#

Why am I getting told when alcove is the one being disrespectful?

serene scaffold Jun 11, 2025, 3:45 AM

#

I'm trying to help you out here.

alpine plank Jun 11, 2025, 3:45 AM

#

Is that not a rule?

#

Ty I appreciate it truly

serene scaffold Jun 11, 2025, 3:45 AM

#

I just got here. If you have a problem with another user, please send a message to @sonic vapor

lapis sequoia Jun 11, 2025, 3:46 AM

#

I've been relatively chill this whole time 🤷‍♂️

alpine plank Jun 11, 2025, 3:46 AM

#

I just don't get it I didn't know that but at the same time wth I didn't do anything wrong.

lapis sequoia Jun 11, 2025, 3:46 AM

#

Guy came in with no proof asking for help with a hacking bot?

alpine plank Jun 11, 2025, 3:46 AM

#

No you have been condescending and not at all chil

serene scaffold Jun 11, 2025, 3:47 AM

#

You two, just stop talking to each other for the time being.

alpine plank Jun 11, 2025, 3:47 AM

#

Not a hacking bot now you are taking it out of context

lapis sequoia Jun 11, 2025, 3:47 AM

#

Lol k

alpine plank Jun 11, 2025, 3:47 AM

#

I would love too

serene scaffold Jun 11, 2025, 3:47 AM

#

!shh

arctic wedgeBOT Jun 11, 2025, 3:47 AM

#

✅ silenced current channel for 4 minute(s).

serene scaffold Jun 11, 2025, 3:47 AM

#

I'm serious. Stop talking to each other.

#

Move on from whatever dispute you're having.

#

If you think the moderators need to respond in some way, please write a description of your issue with the other user and send it to @sonic vapor.

#

No more talking about this.

#

!unshh

arctic wedgeBOT Jun 11, 2025, 3:48 AM

#

✅ unsilenced current channel.

lapis sequoia Jun 11, 2025, 3:48 AM

#

Thx

#

Been tryna come up with a model for rhythms in music.

serene scaffold Jun 11, 2025, 3:50 AM

#

lapis sequoia Been tryna come up with a model for rhythms in music.

so the model should be able to identify the time signature of the music, or what?

lapis sequoia Jun 11, 2025, 3:50 AM

#

Like not "AI" or anything but some good quantitative metrics to categorize different bit strings or something representing a given beat.

#

Usually either bit strings or onset / offset lists... ( we just keep track of time between pulses, or a string of bits where 0 = no pulse and 1 = pulse )

#

( or some other method ig )

serene scaffold Jun 11, 2025, 3:51 AM

#

you don't think this counts as AI?

lapis sequoia Jun 11, 2025, 3:51 AM

#

Bit strings are nice cause you can get necklaces.

serene scaffold Jun 11, 2025, 3:52 AM

#

what is a necklace, in this context?

lapis sequoia Jun 11, 2025, 3:52 AM

#

To me AI is mainly neural nets.

#

Idk if that's the consensus

serene scaffold Jun 11, 2025, 3:53 AM

#

not really.

lapis sequoia Jun 11, 2025, 3:53 AM

#

Eh, anythings AI if you try hard enough ig

#

To me the stuff they call AI now was something way different years ago

serene scaffold Jun 11, 2025, 3:54 AM

#

there's no widely agreed-upon definition of AI, but most definitions basically fall under one of two camps:

programs that emulate the application of knowledge
programs that do whatever no one can currently do

lapis sequoia Jun 11, 2025, 3:54 AM

#

Been doing stuff for most my life at 20 ( decade of my life, started young af, currently in college for a degree and masters ;) )

lapis sequoia Jun 11, 2025, 3:55 AM

#

serene scaffold there's no widely agreed-upon definition of AI, but most definitions basically f...

Ig so

#

I just think that generally projects which say they are using AI usually mean there's a neural net somewhere...

#

Cause that's usually the case. That or game theory stuff.

#

Or uh... path finding?

serene scaffold Jun 11, 2025, 3:56 AM

#

I guess there's a third definition
3) same as (1), but where the technology is so nascent that you need researchers to figure out if it will work for your use case.

lapis sequoia Jun 11, 2025, 3:57 AM

#

That's a new word, nascent

#

Anyways with the music thing it's more math than anything AI...

#

I just want to make a managable domain to explore rhythms which are similiar...

#

Ideally I'd create some "beat distance function"

serene scaffold Jun 11, 2025, 3:59 AM

#

I know the best words. that's why I'm the word-talking guy.

#

!otn s word talk

lapis sequoia Jun 11, 2025, 3:59 AM

#

But I've yet to find a set of metrics that really works metacognitively speaking

arctic wedgeBOT Jun 11, 2025, 3:59 AM

#

Query results

stel-the-word-talkin-guy

lapis sequoia Jun 11, 2025, 3:59 AM

#

I like new words

#

Why I like reading old books

serene scaffold Jun 11, 2025, 3:59 AM

#

metacognitively
great word

#

just throw "meta" and "nexus" into your sentences and everyone will love it.

lapis sequoia Jun 11, 2025, 4:00 AM

#

Metacognitive judgement is a real distinction

#

Mainly in research for neurology or psychology n all that

#

( basically, if they give you a little questionaire, that's metacognitive judgement... )

#

Not as reliable as other kinds of measurement but eh, if it works it works ig...

#

Beats hooking everyone up to a fMRI machine...

#

My main deal is connecting music theory and music psychology

#

I feel like books either talk about one or the other...

#

Never both

#

Makes the music theory a bit bland and the music psychology a bit wistful

lapis sequoia Jun 11, 2025, 4:05 AM

#

serene scaffold what is a necklace, in this context?

https://en.m.wikipedia.org/wiki/Necklace_(combinatorics)

Necklace (combinatorics)

In combinatorics, a k-ary necklace of length n is an equivalence class of n-character strings over an alphabet of size k, taking all rotations as equivalent. It represents a structure with n circularly connected beads which have k available colors.
A k-ary bracelet, also referred to as a turnover (or free) necklace, is a necklace such that stri...

#

Here's this btw... really cool for beats.

#

Cause we can have metrics on "grooves" represented by bit strings

#

( we don't care where they start and end )

#

Only real issue is generating them ig... but metrics can technically be pre-calculated then mapped to a given beat bit string...

#

But throwing crap at the wall, none of it really sticked... might revisit the idea later

#

( graphs deleted cause they ain't useful )

#

📎 8bit_necklaces_optimized.txt

arctic wedgeBOT Jun 11, 2025, 4:14 AM

#

lapis sequoia

Click here to see this code in our pastebin.

lapis sequoia Jun 11, 2025, 4:15 AM

#

There's the pre-calculated metrics for 8 bit necklaces if anyone cares.

#

Gn

#

( those metrics can probably be compressed further btw given the tendency for given rotation-based metrics to repeat periodically )

echo iris Jun 11, 2025, 5:51 AM

#

Artificial idiot

untold cliff Jun 11, 2025, 9:39 AM

#

Can an LLM predict the pad_token as output? And during fine tuning, why do we usually (or maybe always) pad to the left for autoregressive models instead of to the right? why does it matter? especially since we do pass the attention_mask as input to the model as well so it should know not to predict at the padding positiions whether they're at the left or the right of the input no?
And for most of these models, they don't even have a pad_token in their vocabulary, we usually just set it to the eos_tokenso it would make more sense to have padding to the right maybe? (because we have implemented logic for the models to stop generation as soon as they output eos_token?) And does the choice of the pad_token for these models, when it is not present already, matter? or we can set it to be anything, not necessarily eos_token only?

lapis sequoia Jun 11, 2025, 9:47 AM

#

Hello

#

Can anyone suggest me step to develop ai Inc research

hearty wyvern Jun 11, 2025, 1:14 PM

#

Hi guys does anyone have understanding of how to connect chat gpt open ai api for meta trader 5?

round hatch Jun 11, 2025, 2:10 PM

#

Good morning buddy, who has a bachelors degree in data science

#

Just wondering

#

I have a software engineer friend who told me that you can use postman to connect to API that could work

#

It’s pretty much a reprogrammed application you can use

opaque condor Jun 11, 2025, 3:46 PM

#

Is it possible to make it so that I don't know Network can understand stand between a male plant and a female plant based off of photos and specifically labels?

agile cobalt Jun 11, 2025, 4:05 PM

#

in general if you can collect enough high quality data about something + if your inputs contain enough information that can be transformed into the output, you can train a model to do that transformation

most plants are hermaphrodites though, there is no "male" and "female" for many species

opaque condor Jun 11, 2025, 4:11 PM

#

Trees

#

Some trees have male and female pine cones pumpkins too so

opaque condor Jun 11, 2025, 4:23 PM

#

agile cobalt in general if you can collect enough high quality data about something + if your...

I have trees/pine/male - female but both folders don't show up

#

V (trees)
V (pines)
>Male #no folder
>Female #no folder

verbal oar Jun 11, 2025, 5:31 PM

#

what prerequisites I need for vae?

#

I'm watching machine learning journey vae in jax episode

final cobalt Jun 11, 2025, 5:34 PM

#

Not sure if anyone here has experience with diffusion models

#

https://github.com/lucaswalkeryoung/Diffusion-2

GitHub

GitHub - lucaswalkeryoung/Diffusion-2

Contribute to lucaswalkeryoung/Diffusion-2 development by creating an account on GitHub.

#

I'm hitting a wall. It's learning, but not very well. I've triple checked my math and everything should be in order - but its also really hard to tell when the problem could either be the math or the model or both.

verbal oar Jun 11, 2025, 6:16 PM

#

not very well you mean slow?

#

then maybe related to learning rate or optimizer

dire cipher Jun 11, 2025, 7:13 PM

#

I have a broad question if I'm able to ask it here:

I'm transferring from a more data analysis from a cyber and general security background with 8 years of experience and trying to branch out more into Data Science, also trying to start my masters for it in August.
What would be some newer tips to lean into as far as what common entry data science jobs are looking for?
I've just been seeing years of experience in this and that with a graduate degree w/ years of experience. What would be the most beneficial thing to start in?

serene scaffold Jun 11, 2025, 7:15 PM

#

dire cipher I have a broad question if I'm able to ask it here: I'm transferring from a mor...

seeing years of experience in this and that
what are the "this and that"?

quaint vector Jun 11, 2025, 7:17 PM

#

dire cipher I have a broad question if I'm able to ask it here: I'm transferring from a mor...

You lesbian

dire cipher Jun 11, 2025, 7:17 PM

#

Experience in Python, R, SQL, specific databases, et cetera.

quaint vector Jun 11, 2025, 7:17 PM

#

dire cipher Experience in Python, R, SQL, specific databases, et cetera.

You lesbian

#

Ah hell nah

serene scaffold Jun 11, 2025, 7:18 PM

#

quaint vector You lesbian

why are you asking people if they're lesbian?

proper lodge Jun 11, 2025, 7:19 PM

#

Homophobic?

serene scaffold Jun 11, 2025, 7:20 PM

#

let's get back to answering Bella's question.

proper lodge Jun 11, 2025, 7:20 PM

#

Unfortunately I can't do much I'm a novice Python programmer

serene scaffold Jun 11, 2025, 7:21 PM

#

dire cipher I have a broad question if I'm able to ask it here: I'm transferring from a mor...

will your masters have a research component?

dire cipher Jun 11, 2025, 7:22 PM

#

I believe so, I'm still debating in taking more applied Data Science route or Advanced AI modeling.

verbal oar Jun 11, 2025, 7:25 PM

#

I have master's in data science related but no years of experience

#

I hope this is ideal requirement like years of experience, maybe projects could show sth

#

my master's contained machine learning, nlp, process optimization methods, fuzzy systems, metaheuristic methods

#

I'm curious what masters data science have

#

you can take both but would be challenging but there is possibility

#

machine learning, nlp, process optimization methods was all in R

#

I got to known some latex, rmarkdown, knitr

#

to be more specific my master's is in applied informatics, specialty: intelligent systems

#

I'm curious what specialties are in data science

dire cipher Jun 11, 2025, 7:38 PM

#

The Graduate Certificate I'll be getting is bare bones knowledge then transfer into their Graduate Program because of the Accreditation I have are Nationally Accredited, so I have to gain Regionally Accreditation from being denied by two well know universities Ohio State and University Maryland Global Campus from the Accreditation type.

#

Its been a process in the past three months trying to find places to except me. And gaining my technical experience and knowledge from being in the Military too. :/

verbal oar Jun 11, 2025, 7:43 PM

#

I know why he asked about lesbian because in profile she has married lesbian but here this topic is offtopic

dire cipher Jun 11, 2025, 7:50 PM

#

Yea I figured lol, its like ok cool you sure you want to call yourself out. I do appreciate confronting the user yall. 🙏

verbal oar Jun 11, 2025, 8:00 PM

#

look at plan of studies of data science vs advanced AI modeling

#

compare and decide what are you interested in more

#

which has more job prospects

#

or if you do just for curiosity

#

I did my studies just out of curiosity maybe not great decision

#

but I liked them in overall sometimes got frustrated maybe its normal

dire cipher Jun 11, 2025, 8:04 PM

#

verbal oar but I liked them in overall sometimes got frustrated maybe its normal

Yea its normal to be a bit frustrated lol, from my experience for sure xD

final jolt Jun 11, 2025, 9:05 PM

#

hmm so in parsing some input data using pandas I have found a corner case and not sure the best way to handle it.
Basically I parse tables out of a pdf document and I have encountered a case where I get false positives. This causes the df.columns = ["etc"] that I am doing to prepare the data to hang because the dataframe contains the wrong number of columns. Should I just do like

if len(df.columns) != 5: 
  #skip over to the next table```
or something else?  It doesnt seem to trigger any exception or error it just hangs so I am not sure of other options currently

#

hmm nevermind it does actually throw an exception but it doesnt actually raise it normally. Odd. Though I dont actually care about the exception itself I don't think at this time so I'll just go with the length check

serene scaffold Jun 11, 2025, 11:07 PM

#

final jolt hmm nevermind it does actually throw an exception but it doesnt actually raise i...

What does it mean to throw an exception without raising it?

junior flame Jun 11, 2025, 11:29 PM

#

can somebody tell some intuition about attention layers in transformer architecture?

final jolt Jun 12, 2025, 12:31 AM

#

serene scaffold What does it mean to throw an exception without raising it?

Poor word choice. I mean it wasn't actually crashing out just sitting there like it was hung. Had to put a try except to find out what was really going on

arctic blade Jun 12, 2025, 1:22 AM

#

I made a little ‘ai’ that i trained on the mnist, to correctly identify hand-drawn digits, and got it to learn from itself with each use. Whats next? Whats the next ai based project i should do? Or are there features to add to the digit recognition?

serene scaffold Jun 12, 2025, 1:47 AM

#

arctic blade I made a little ‘ai’ that i trained on the mnist, to correctly identify hand-dra...

Do you want to stick with image related stuff or go in a different direction?

grand minnow Jun 12, 2025, 1:53 AM

#

Is there a module that helps sanitizes user inputs before it gets into LLM?

serene scaffold Jun 12, 2025, 1:57 AM

#

grand minnow Is there a module that helps sanitizes user inputs before it gets into LLM?

Sanitize how?

left tartan Jun 12, 2025, 2:02 AM

#

grand minnow Is there a module that helps sanitizes user inputs before it gets into LLM?

Yah, 'how' is such an important question. If you mean, truly anonymizing data? That's an incredibly hard problem in the abstract.... specifics matter

grand minnow Jun 12, 2025, 2:12 AM

#

I'm trying to figure out a way to prevent these prompt injections from happening. Like from this article: https://hiddenlayer.com/innovation-hub/novel-universal-bypass-for-all-major-llms/ @left tartan @serene scaffold

HiddenLayer | Security for AI

Novel Universal Bypass for All Major LLMs

HiddenLayer’s latest research uncovers a universal prompt injection bypass impacting GPT-4, Claude, Gemini, and more, exposing major LLM security gaps.

#

So if I could stop at a user's input, that would be great. Then there's another method I could think of is to build another model to classify if the text contains or in the sentiment of prompt injection, I could pass that to the AI Agent as a MCP tool and then respond that changing of policies is not allowed or anything like that

#

I figured, someone must have already done this already

left tartan Jun 12, 2025, 2:16 AM

#

Interesting, I know nothing about this topic but did you ever see that game/puzzle around bypassing ai safeguards?

grand minnow Jun 12, 2025, 2:16 AM

#

I have not

left tartan Jun 12, 2025, 2:17 AM

#

https://gandalf.lakera.ai/baseline

grand minnow Jun 12, 2025, 2:18 AM

#

oooooh

fallow coyote Jun 12, 2025, 11:13 AM

#

What does a ‘universal prompt injection’ do? Does it allow you to get ChatGPT and other such services to give you answers for anything you tell it to, even if its against the guidelines?

late kelp Jun 12, 2025, 1:56 PM

#

I am working on sentiment analysis project and Want to train The ML Model, And for training the model i want to scrape the data from instagram and facebook is any free api key available for scrape the data , i also try the playwright and selenium but that could not fullfil my need becuse the instagram hide its data behind the wall of login page when i logged in then it allow me to access the data but i dont want to login i want to scrap the data for all public pages

verbal oar Jun 12, 2025, 2:00 PM

#

using api is not scraping

#

use x/twitter dataset

#

oh wait you want from Instagram and Facebook
but twitter dataset is ok to start

#

or how it is called forgot

late kelp Jun 12, 2025, 2:03 PM

#

verbal oar oh wait you want from Instagram and Facebook but twitter dataset is ok to start

yes I Want Data of x, instagram and Facebook for training

agile cobalt Jun 12, 2025, 2:10 PM

#

late kelp I am working on sentiment analysis project and Want to train The ML Model, And f...

nope, you need to pay them for it, and even then you would be bound by the terms of use.

I'd strongly recommend using existing datasets from Hugging Face or Kaggle instead, or collecting from an open platform like Blue Sky or Mastodon instead

late kelp Jun 12, 2025, 2:17 PM

#

agile cobalt nope, you need to pay them for it, and even then you would be bound by the terms...

But my Scenario is change like i am Send Model A User name and also brand name then ML Model WIll tell me that this influencer is fit with your brand , and scoring the influencer on different criterias like, sexuality , vulgor content , controversal etc , SO That is Why I Want To Scrap The Data of User By Using The User Profile link of Facebook and then scrap all the posts and then i work on that data furture

agile cobalt Jun 12, 2025, 2:25 PM

#

not sure if that's possible without violating Meta's Terms of Service, and we don't assist with anything that violates any platform's ToS

late kelp Jun 12, 2025, 2:30 PM

#

agile cobalt not sure if that's possible without violating Meta's Terms of Service, and we do...

But i did not use this for like any illegal purpose i am only use for tarining for my model for education purpose

agile cobalt Jun 12, 2025, 2:32 PM

#

late kelp But i did not use this for like any illegal purpose i am only use for tarining f...

does not matters, also that use case you described is about as commercial as it gets

late kelp Jun 12, 2025, 2:33 PM

#

Ok No Issue Thanks

opaque condor Jun 12, 2025, 3:04 PM

#

If I gave a general adversary Network to generate an image with text and then I told it to also move a robot to the left corner and I've trained it on all that data could it do both at the same time

arctic blade Jun 12, 2025, 3:35 PM

#

serene scaffold Do you want to stick with image related stuff or go in a different direction?

I feel like all the image recognition stuff is gonna be kinda similar, what other kind of projects are there?

viral shell Jun 12, 2025, 4:01 PM

#

Hi all, iam Starting with data science, kindly help me

serene scaffold Jun 12, 2025, 4:16 PM

#

viral shell Hi all, iam Starting with data science, kindly help me

what do you need help with?

rich condor Jun 12, 2025, 4:47 PM

#

Question:

What options are there for structured output, other than Outlines?

What options are there for DSL or logical constraints, other than LMQL?

rain kelp Jun 12, 2025, 4:57 PM

#

i have been creatinf a model for time based predictions. my model has been improving by a lot but i would like to change my training loop to show me the accuracy of the model in percentage. i cant seem to find a way to do it.
this is my loop, if someone has a suggestion please tag me:

def evaluate_model(X, y, model, test_size=0.2, shuffle=False, plot_errors=True):
X_train, X_test, y_train, y_test = train_test_split(
X, y, test_size=test_size, shuffle=shuffle
)

model.fit(X_train, y_train)
y_fit = model.predict(X_train)
y_pred = model.predict(X_test)

train_mae = mean_absolute_error(y_train, y_fit)
test_mae = mean_absolute_error(y_test, y_pred)

print(f"Training MAE: {train_mae:.2f}")
print(f"Test MAE: {test_mae:.2f}")

#

i was thinking on using this: from sklearn.metrics import mean_absolute_percentage_error
mape = mean_absolute_percentage_error(y_test, y_pred)
print(f"MAPE: {mape:.2f}% ➜ Accuracy ≈ {100 - mape:.2f}%")

spring field Jun 12, 2025, 8:24 PM

#

rain kelp i was thinking on using this: from sklearn.metrics import mean_absolute_percenta...

and how did that go?

spring field Jun 12, 2025, 8:29 PM

#

opaque condor If I gave a general adversary Network to generate an image with text and then I ...

probably? do you want to combine a GAN with RL or something similar?
like, for instance, how would this "movement" behave? would it just teleport the robot?
what's the overall purpose of doing this in the first place?

rain kelp Jun 12, 2025, 8:33 PM

#

spring field and how did that go?

Not good, I think I made a mistake with mape. I just resorted to plotting the y_pred and y_test to see if it was accurate

opaque condor Jun 12, 2025, 8:43 PM

#

spring field probably? do you want to combine a GAN with RL or something similar? like, for i...

It creates images from where it's been around in a specific style common for that area and I wanted to make the body so I couldn't actually move around I do have to make a language model and then go on to generative algorithms I'm just planning for the future saying a goal that I hope that I can somehow get to

wet dome Jun 12, 2025, 8:58 PM

#

If your dataset has null values is it better to drop them and completely ignore the entire row or fill them in by replacing them with a mean for example

opaque condor Jun 12, 2025, 8:59 PM

#

Me?

final jolt Jun 12, 2025, 9:02 PM

#

wet dome If your dataset has null values is it better to drop them and completely ignore ...

honestly that would depend entirely on what your dataset is and the purpose of the data and the degree of precision. If it is something like averaging data points over a period of time substituting the mean value of the surrounding points could be fine. But yea it depends

nimble acorn Jun 13, 2025, 2:06 AM

#

hello, anyone here ever worked with a NLP project with a low resource language?

grand minnow Jun 13, 2025, 2:12 AM

#

nimble acorn hello, anyone here ever worked with a NLP project with a low resource language?

Whats a low resource language?

nimble acorn Jun 13, 2025, 2:15 AM

#

grand minnow Whats a low resource language?

limited online data, limited annotated data, POS tagged data for that language.

grand minnow Jun 13, 2025, 2:17 AM

#

nimble acorn limited online data, limited annotated data, POS tagged data for that language.

Like embedded?

grand minnow Jun 13, 2025, 2:18 AM

#

nimble acorn limited online data, limited annotated data, POS tagged data for that language.

Is that like micropython?

nimble acorn Jun 13, 2025, 2:19 AM

#

grand minnow Like embedded?

look at conllu file here. https://github.com/UniversalDependencies/UD_Amharic-ATT

fickle shale Jun 13, 2025, 5:05 AM

#

How to create good project for portfolio?

grand minnow Jun 13, 2025, 5:45 AM

#

fickle shale How to create good project for portfolio?

!kindling has a good list of projects to build for your portfolio. Another way is to actually sell your service to a client, then whatever you made for that client, feature it in your portfolio

arctic wedgeBOT Jun 13, 2025, 5:45 AM

#

Kindling Projects

The Kindling projects page contains a list of projects and ideas programmers can tackle to build their skills and knowledge.

peak field Jun 13, 2025, 6:25 AM

#

hey guys

#

Any datasets which i can either use transfer learning on or just a exisiting dataset i can train

#

for sleep deprivation levels

#

possibly using the Karolinska Sleepiness Scale

#

But doesnt require that

#

Thanks

austere swift Jun 13, 2025, 8:04 AM

#

peak field for sleep deprivation levels

A quick Kaggle search led me to this: https://www.kaggle.com/datasets/vitoraugustx/drowsiness-dataset, is that what you're looking for?

north jackal Jun 13, 2025, 10:14 AM

#

Does anyone know if theres a way to append an excel sheet with polars? I know I can do it with pandas, just polars.write_excel seems to overwrite the whole file

agile cobalt Jun 13, 2025, 12:06 PM

#

north jackal Does anyone know if theres a way to append an excel sheet with polars? I know I ...

the simplest way is probably just loading it, concatenating then rewriting the entire thing
not sure if you can get it working "better" by using position=

imo you almost definitely should use a different file format like parquet or csv, and if you really need of Excel convert from the other format to excel in the end after you're done appending

peak field Jun 13, 2025, 12:43 PM

#

austere swift A quick Kaggle search led me to this: https://www.kaggle.com/datasets/vitoraugus...

Hey

#

Sorry I should have been more clear

#

I wanted a dataset for sleep deprivation with faces

#

Or i heard that voices can also be accurate

#

I cant find too many results on that

peak field Jun 13, 2025, 12:46 PM

#

austere swift A quick Kaggle search led me to this: https://www.kaggle.com/datasets/vitoraugus...

thanks so much tho

raw owl Jun 13, 2025, 2:37 PM

#

Guys what programming languages do you need to know in coding if you want to be an ai research scientist

serene scaffold Jun 13, 2025, 2:54 PM

#

raw owl Guys what programming languages do you need to know in coding if you want to be ...

Python is the main one, but it's more important that you have a masters degree in computer science

azure wraith Jun 13, 2025, 3:22 PM

#

Thinking of getting a new Macbook Pro. Thoughts on M4 Pro vs M4 Max in 16" for data science/ai work locally?

#

Understood. I'm a newbie at this, so it likely wouldn't be anything too serious locally.

rich moth Jun 13, 2025, 3:28 PM

#

azure wraith Understood. I'm a newbie at this, so it likely wouldn't be anything too serious ...

If you're gonna go down that road. I'd do a little research first.

#

Waste of money IMO. You could build a serious AMD and Nvidia PC and still maintain 192 gigs of ram for way less.

azure wraith Jun 13, 2025, 3:31 PM

#

Let me clarify, I'm going to replace my current Macbook Pro anyway, so it's really "which one", not "whether" :) i.e. is the M4 Max worth the additional expense.

serene scaffold Jun 13, 2025, 3:31 PM

#

azure wraith Thinking of getting a new Macbook Pro. Thoughts on M4 Pro vs M4 Max in 16" for d...

I don't recommend buying a computer with local AI work in mind. No matter what computer you get, there will always be things you won't be able to do on it, which would require renting cloud compute.

late flume Jun 13, 2025, 3:31 PM

#

azure wraith Let me clarify, I'm going to replace my current Macbook Pro **anyway**, so it's ...

To what exactly?

iron basalt Jun 13, 2025, 3:32 PM

#

azure wraith Let me clarify, I'm going to replace my current Macbook Pro **anyway**, so it's ...

They are all pretty fast, just save some money.

rich moth Jun 13, 2025, 3:32 PM

#

I see, well my wife got a M3 a bit ago, 16gigs. Its really fast and efficient for what it is. I'm sure the M4's are powerhouses I'm just thinking compatibility issues down the road.

iron basalt Jun 13, 2025, 3:32 PM

#

Make sure you have enough RAM.

rich moth Jun 13, 2025, 3:33 PM

#

192 gigs of ram on an M4? i bet thats $$$

iron basalt Jun 13, 2025, 3:33 PM

#

No, just don't get the 8 GB.

azure wraith Jun 13, 2025, 3:33 PM

#

Let's focus people, it's just Pro vs Max :)

rich moth Jun 13, 2025, 3:33 PM

#

why would you get 8gb if you want todo ai research? you are crippling yourself from the gate

iron basalt Jun 13, 2025, 3:34 PM

#

rich moth why would you get 8gb if you want todo ai research? you are crippling yourself ...

Some people do this to save money, but due to massive software bloat, 8 GB is not much anymore.

rich moth Jun 13, 2025, 3:35 PM

#

azure wraith Let's focus people, it's just Pro vs Max :)

at 48 GIGS for either, I wont get either, but if I had to then the PRO.

iron basalt Jun 13, 2025, 3:38 PM

#

azure wraith Thinking of getting a new Macbook Pro. Thoughts on M4 Pro vs M4 Max in 16" for d...

The Max has a larger GPU, about double (in reality probably like 50% gain at most). But that is still a weak GPU for the type of things that GPUs are used for in AI, so just get a Pro. There is a large difference between regular and Pro.

#

Regular is clearly meant for actual laptop usage, low power.

azure wraith Jun 13, 2025, 4:04 PM

#

I should've looked further on Apple's site. If you want 64GB or 128GB, you're limited to the M4 Max. That made the choice easy enough :)

coarse valve Jun 13, 2025, 6:54 PM

#

I have a large parameter set of categorical information, none of which is ordinal and I've been tasked with clustering them into some semblance of segments. Too many unique values for one hot..so I am thinking about Binary encoding or embedding...I am confused on a strategy of embedding when I need to aggregate the data. Obviously there is just sum, mean, ecc...but does anyone have any experience with this? If you have what things were dont to get the most separation in your groupings?

waxen kindle Jun 13, 2025, 7:35 PM

#

There is something like PCA but for categorical data, I don't remember it's name but you can take a look to it

#

Like MCA maybe ?

lapis sequoia Jun 13, 2025, 9:01 PM

#

Hello
Can anyone suggest me step to develop ai Inc research

serene scaffold Jun 13, 2025, 9:01 PM

#

lapis sequoia Hello Can anyone suggest me step to develop ai Inc research

What is AI inc?

lapis sequoia Jun 13, 2025, 9:01 PM

#

including research steps

serene scaffold Jun 13, 2025, 9:13 PM

#

lapis sequoia Hello Can anyone suggest me step to develop ai Inc research

What do you want the model you develop to be able to do

lapis sequoia Jun 13, 2025, 9:18 PM

#

serene scaffold What do you want the model you develop to be able to do

chatbot

lapis sequoia Jun 13, 2025, 9:58 PM

#

serene scaffold What do you want the model you develop to be able to do

Can you teach me the steps to develop a chat bot ?

uneven ledge Jun 13, 2025, 10:13 PM

#

hi guys im new to data science and python altogether, do you guys have any recommendation on where to start or any tutorial I can follow?

waxen kindle Jun 13, 2025, 10:13 PM

#

!res

arctic wedgeBOT Jun 13, 2025, 10:13 PM

#

Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

uneven ledge Jun 13, 2025, 10:22 PM

#

alr thx man

serene scaffold Jun 13, 2025, 11:43 PM

#

I don't recommend that. Chatbots cost millions of dollars and many terabytes of data to develop

lapis sequoia Jun 14, 2025, 12:58 AM

#

serene scaffold I don't recommend that. Chatbots cost millions of dollars and many terabytes of ...

then what can i do for my final year project

serene scaffold Jun 14, 2025, 1:46 AM

#

lapis sequoia then what can i do for my final year project

if you're interested in language technology, something like a spam email detector would be more attainable

untold dove Jun 14, 2025, 4:36 AM

#

hey stelercus could i have you look at a model architecture i have created seems like u have a good understanding and maybe would be able to give some feedback

#

@serene scaffold

#

if not all good and thanks anyway

livid crown Jun 14, 2025, 5:16 AM

#

hello everyone, i currently thinking about a making a chatbot on local, I thought about using llama2 model. And chatbot have to generate text on my own language/not english/, and i am gonna use RAG technique to make llm understand a data. I am actually very new to AI field, and i thought about using a API translator to translate prompt to model and from model. Is it good idea or is there any better idea to run llm on my own language? sorry for my bad english.

lapis sequoia Jun 14, 2025, 7:16 AM

#

serene scaffold if you're interested in language technology, something like a spam email detecto...

Actually I thought the project I do for my final will help me to get into smtg big colleges for masters so I want to do such project brother

jaunty helm Jun 14, 2025, 8:44 AM

#

livid crown hello everyone, i currently thinking about a making a chatbot on local, I though...

llama 2 is very old, and there's almost 0 reason you'd use that rather than llama 3 (or any of the other newer models)
you don't need to "translate" models, they're almost certainly trained on text of various languages; just talk to it in your native language, chances are it'll talk in your language back

lean basalt Jun 14, 2025, 10:23 AM

#

hi lads! I have learnt basic of python and ai and want to sharpen my knowledge. In my area there is no direct source of info what do yall suggest I should do now

fallow coyote Jun 14, 2025, 12:30 PM

#

lean basalt hi lads! I have learnt basic of python and ai and want to sharpen my knowledge. ...

Learn a shit ton of maths. Genuinely, itll help you a lot i understanding the tool

livid crown Jun 14, 2025, 12:32 PM

#

jaunty helm 1. llama 2 is very old, and there's almost 0 reason you'd use that rather than l...

using llama3.2 and it wasnt trained on my language, if so should i use translator model?

tidal bough Jun 14, 2025, 12:38 PM

#

I'd actually be quite surprised if there's a language for which simultaneously:

there's enough data that machine translation is decent
there's not enough data for Llama3 to speak it

#

You could try a translation layer, but I don't think it'll go well

verbal oar Jun 14, 2025, 4:36 PM

#

for chatbot you make:
chat - websockets
bot - maybe naive bayes classifier (some model)

#

I made one recommendation bot but still didnt fix order of messages (didnt remove asyncio)

#

https://github.com/grzesiekmq/shopbot

GitHub

GitHub - grzesiekmq/shopbot: shopping bot in python with naive baye...

shopping bot in python with naive bayes and Market Basket Analysis - grzesiekmq/shopbot

fallow coyote Jun 14, 2025, 5:02 PM

#

Ive asked before but can someone recommend good beginner books for linear algebra? Ive tried reading gilbert strangs book but the way he says even the simplest of things is too confusing. Im using my high school (a level, futher pure maths) books to relearn matrices and vector. Id prefer to use books rather than wtch videos

wooden sail Jun 14, 2025, 5:58 PM

#

fallow coyote Ive asked before but can someone recommend good beginner books for linear algebr...

what about it troubes you?

#

gilbert strang's book is one of the simpler ones on linalg

#

maybe you want to look at intro to proof-writing material first to get familiar with how math is done outside of school

iron basalt Jun 14, 2025, 6:08 PM

#

fallow coyote Ive asked before but can someone recommend good beginner books for linear algebr...

Are you maybe looking for a more mechanical explanation, the kind of understanding of linear algebra a physics student would have (initially, eventually they go to Gilbert)?

lean basalt Jun 14, 2025, 8:43 PM

#

fallow coyote Learn a shit ton of maths. Genuinely, itll help you a lot i understanding the to...

will high school math be enough or more topics you suggest I should focus

waxen kindle Jun 14, 2025, 8:52 PM

#

You need linear algebra

#

Optimization

#

Partial derivatives

lean basalt Jun 14, 2025, 8:56 PM

#

waxen kindle Partial derivatives

ig they teach all this in high school atleast till A2 level

waxen kindle Jun 14, 2025, 8:58 PM

#

No idea, that things depends on the country

#

I learnt most of that in my first years of college

#

Some in high school

fallow coyote Jun 14, 2025, 9:54 PM

#

wooden sail gilbert strang's book is one of the simpler ones on linalg

I dont like he explains things. Its not the maths just how he explains things. I read sheldon axler book. That book I understood more but I cant go further as it feels like i need to brush up on my basic vector and matrix knowledge first to continue further

fallow coyote Jun 14, 2025, 9:55 PM

#

iron basalt Are you maybe looking for a more mechanical explanation, the kind of understandi...

Kinda. Tbf Id prefer a book that taught the theory and then applied it straight away

gentle storm Jun 15, 2025, 3:36 AM

#

Shakespear: ❌
Spearshake: ✅

tropic venture Jun 15, 2025, 3:29 PM

#

if there's a people who have dealt with astropy, is there a way to increase the precision of time conversions in astropy.time using external IERS or ERFA data?

past meteor Jun 15, 2025, 6:33 PM

#

Nice to have you back to hear about what you're building haha

#

What are you cooking up? (pun intended)

past meteor Jun 15, 2025, 6:55 PM

#

The tricky thing is that I feel like you'll have to prompt all the things you want to check

#

Hahaha this is actually cool

#

Are you using MCP or just good ol' API calls + structured output

#

It's a protocol that allows you to define how to interact with tools and resources in a unified way.

For instance, Github has made an MCP server, so you can give that to an MCP client and it will know how to execute actions on it automatically

#

Think OpenAPI but built for LLMs, they can browse it like humans would browse/interact with a swagger page

#

That, and more but let's say it's just that yeah

#

I prefer this as well

#

Regular code and control flow with the LLMs spitting out structured outputs

#

Which I guess is what pydantic AI also encourages, cause it's ... by pydantic

#

I always use the SDKs of the providers tho

#

Never bother with any abstractions on top of that

#

Me too, but OpenAI can spit out pydantic models

#

Same with Gemini models

#

Claude can't natively do it, but you can prefill + json repair and then cast it to a pydantic model and slap a couple of retries on top

#

Makes sense

#

I like using the raw stuff because I'm a cowboy 🤠

#

If say openai has a new thing that is a gamechanger and I read it, I can have it in dev by end of day and maybe prod by end of week

#

Depending on how big the thing is, the 3rd party wrapper may have to accomodate for it

noble moss Jun 16, 2025, 6:05 AM

#

Uh is someone willing to network im a data science guy

#

plz dm

unreal yoke Jun 16, 2025, 6:07 AM

#

hi

valid minnow Jun 16, 2025, 8:09 AM

#

I see bit of AI talk in here.
Anyone up for a networking or discussion on data engineering and AI adoption.

noble moss Jun 16, 2025, 8:57 AM

#

valid minnow I see bit of AI talk in here. Anyone up for a networking or discussion on data e...

Im willing to network

#

On data analysis and ML and shi

valid minnow Jun 16, 2025, 8:59 AM

#

noble moss On data analysis and ML and shi

Let's connect in Voice chat 1?

noble moss Jun 16, 2025, 9:05 AM

#

Im a bit busy rn

#

Dms?

weary timber Jun 16, 2025, 11:16 AM

#

noble moss Uh is someone willing to network im a data science guy

how does that work

#

you just network?

zinc patrol Jun 16, 2025, 2:11 PM

#

Lets goo! My AI bot knows which is which!

'Democracy is vital for global stability' -> USA
'Taiwan is an inalienable part of China' -> China
'We support NATO expansion' -> USA
'Sovereignty must be respected' -> China```

#

but wth it still has terrible accuracy

#

I think I should use leaky ReLU

#

can someone help me over here?

#

!paste

arctic wedgeBOT Jun 16, 2025, 2:14 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

zinc patrol Jun 16, 2025, 2:14 PM

#

https://paste.pythondiscord.com/XRZA

short moth Jun 16, 2025, 3:19 PM

#

Yo I am learning pandas and have a quick question.
I have data and I want to create a new data set with values of Low if the value is less than mean and High is it is greater than mean, this is what I did, it works but let me know If I can do something better

#

mask = label_num.lt(label_num.mean())
def low_high(val):
    if val in label_num[mask]:
        return "Low"
    return "High"
label_num.apply(low_high)

#

i know I can use the .where but the exercise requires me to use .apply

serene scaffold Jun 16, 2025, 3:19 PM

#

short moth Yo I am learning pandas and have a quick question. I have data and I want to cre...

sounds like you want to create a new column. not a dataset.

short moth Jun 16, 2025, 3:20 PM

#

yeah i dont know the vocab I am new

serene scaffold Jun 16, 2025, 3:20 PM

#

short moth i know I can use the .where but the exercise requires me to use .apply

what exercise are you doing?

short moth Jun 16, 2025, 3:20 PM

#

its from a book

serene scaffold Jun 16, 2025, 3:20 PM

#

you should avoid .apply as much as you ever possibly can.

#

what book?

short moth Jun 16, 2025, 3:20 PM

#

Effective Pandas Matt Harrison

#

yeahj its just part of the exericse, next one is to do the same but with .where

serene scaffold Jun 16, 2025, 3:21 PM

#

do you know how to make a new column in general?

short moth Jun 16, 2025, 3:21 PM

#

I have not learned dataframes so no

#

i am pretty young in my development in this

#

like I just started

serene scaffold Jun 16, 2025, 3:22 PM

#

if you "don't know dataframes", you might be too far ahead in the book

short moth Jun 16, 2025, 3:22 PM

#

the question is in chapter 9

#

page 80\

#

it says

#

Create a series from a numeric column that has the value of 'high' if it is equal to or abnove the mean and 'low'; if it below the mean using .apply

#

so maybe I did not explain the question right sorry about that

serene scaffold Jun 16, 2025, 3:23 PM

#

okay. a series is a stand-alone column

short moth Jun 16, 2025, 3:23 PM

#

yeah

#

this is my dataset:

0      127
1      127
2      127
3      127
4      127
      ... 
382    335
383    335
384    335
385    335
386    335
Name: LabelNumber, Length: 387, dtype: int64

#

I created this

serene scaffold Jun 16, 2025, 3:24 PM

#

so how do you get the mean value of that column as a float

short moth Jun 16, 2025, 3:24 PM

#

i just created a filter

serene scaffold Jun 16, 2025, 3:24 PM

#

a filter?

short moth Jun 16, 2025, 3:24 PM

#

mask = label_num.lt(label_num.mean())

serene scaffold Jun 16, 2025, 3:25 PM

#

you need the mean of the column as a float. not a filter or a mask.

short moth Jun 16, 2025, 3:25 PM

#

oh

#

so do I just do label_num.mean()?

serene scaffold Jun 16, 2025, 3:25 PM

#

right

#

and you need a function that takes one number (an integer or a float) and returns "Low" if the value is less than that number

#

otherwise "High"

short moth Jun 16, 2025, 3:26 PM

#

ah yes

#

I over engineered it

#

def low_high(val):
    if val < label_num.mean():
        return "Low"
    return "High"
label_num.apply(low_high)

#

now I shall try with the .where function which will work better!

#

pandas is really great I am glad I am learning this

#

thanks!

serene scaffold Jun 16, 2025, 3:27 PM

#

remember to only use .apply if you're sure there is no other way.

young granite Jun 16, 2025, 3:51 PM

#

guys,
is there a way to use seeding on the GPU rather than CPU?
Currently i need to recache my dataset to the GPU for every seed, there has to be a better way 😄

young granite Jun 16, 2025, 5:43 PM

#

not really,
id like to cache the whole dataset to the GPU once and then perform n-times a k-fold CV, but i think pytorch doesnt support something like this natively

agile cobalt Jun 16, 2025, 5:44 PM

#

you would need to either be working with an extraordinarily small dataset and/or an extraordinarily large GPU to fit the entire thing at once

young granite Jun 16, 2025, 5:46 PM

#

i mean currently its a relatively simple finetuning which easily fits inside my 24GB but yes for larger project this approach wouldnt be suitable, however i like to think outside the box for projects which technically allow such stupidity 😄

agile cobalt Jun 16, 2025, 5:47 PM

#

you can try just manually moving it to the GPU yourself instead of using whichever dataset utility you're currently using

young granite Jun 16, 2025, 5:49 PM

#

might have to check NVIDIAs SDK which seems to allow own pipelines

void stone Jun 16, 2025, 6:02 PM

#

I want to learn how neural networks work and I stumbled upon this article
It seemed interesting but I did not understand much : https://medium.com/@peres/resnet-vs-vgg16-which-cnn-architecture-performs-better-on-cifar-10-b6d6bb6e43c4
What you guys think about it and do you have any recommendations for me to learn about it ? I want to use it in some of my future projects

Medium

ResNet vs VGG16: Which CNN Architecture Performs Better on CIFAR-10?

A practical comparison of deep learning architectures using TensorFlow, CIFAR-10, and statistical analysis.

rich moth Jun 16, 2025, 6:46 PM

#

https://github.com/plunder707/UCF-Cosmic-Validation

GitHub

GitHub - plunder707/UCF-Cosmic-Validation: A scientific validation ...

A scientific validation demonstrating the ability of the Universal Complexity Framework (UCF) to extract fundamental physical constants from real astronomical data. - plunder707/UCF-Cosmic-Validation

short moth Jun 16, 2025, 8:11 PM

#

I have this question to solve:
With a dataset of your choice, set the index to monotonically increasing integers starting from 0 and convert these to the string version

#

I have this inital data set label_num_2 with the following data:

Displayed Landing Altitude    127
Displayed Landing Altitude    127
Displayed Landing Altitude    127
Displayed Landing Altitude    127
Displayed Landing Altitude    127
                             ... 
Right N2 Actual               335
Right N2 Actual               335
Right N2 Actual               335
Right N2 Actual               335
Right N2 Actual               335
Name: LabelNumber, Length: 387, dtype: int64

#

I say

label_num_2.reset_index(drop=True)

for the first part

#

I get this

0      127
1      127
2      127
3      127
4      127
      ... 
382    335
383    335
384    335
385    335
386    335
Name: LabelNumber, Length: 387, dtype: int64

#

now how do I convert the indexes to string

#

oh nvm I know

#

s2 = label_num_2.index.astype('str')
label_num_2.rename(s2)

#

wait it dont work but i am close

serene scaffold Jun 16, 2025, 8:19 PM

#

short moth ```python s2 = label_num_2.index.astype('str') label_num_2.rename(s2) ```

label_num_2.index.astype('str') returns a string index.
the name of a dataframe or series is a string.
so you can't do label_num_2.index.astype('str') like that

short moth Jun 16, 2025, 8:21 PM

#

serene scaffold `label_num_2.index.astype('str')` returns a string index. the name of a datafram...

so what do I use

serene scaffold Jun 16, 2025, 8:21 PM

#

short moth so what do I use

I think you can just overwrite an index with series.index = new_index

short moth Jun 16, 2025, 8:23 PM

#

ah ok

#

but I need to rename tho

#

the index

#

so have label_num_2.index = s2

#

and s2 is label_num.index.astype(str)

#

ok this works

#

s2 = label_num_2.index.astype('str')
label_num_2.index = s2

lavish wraith Jun 17, 2025, 3:47 AM

#

Is ploty and power bi is same tools for data analytics

serene scaffold Jun 17, 2025, 3:51 AM

#

lavish wraith Is ploty and power bi is same tools for data analytics

they are different things that have different purposes.

lavish wraith Jun 17, 2025, 4:01 AM

#

@serene scaffold I have just learned Basic Python, Numpy, Panda Matplotlib and seaborn ,could also need to learn power bi,sql,excel,ploty for data analytics or data science please guide me thanks i am very confused

#

Also i create project using python library i mentioned above

#

One more question how English important for job and interview ?? English is not my native language

fiery bane Jun 17, 2025, 4:15 AM

#

lavish wraith <@253696366952316929> I have just learned Basic Python, Numpy, Panda Matplotlib ...

where are you learning all of these from?

#

I suggest pick a free online course?

serene scaffold Jun 17, 2025, 4:17 AM

#

lavish wraith One more question how English important for job and interview ?? English is not ...

if you're applying to work in an English-speaking team, your ability to communicate in English is supremely important.

#

That goes for any language.

lapis sequoia Jun 17, 2025, 4:42 AM

#

serene scaffold if you're applying to work in an English-speaking team, your ability to communic...

Where is the data analysis room?

serene scaffold Jun 17, 2025, 4:42 AM

#

lapis sequoia Where is the data analysis room?

This is the one.

lapis sequoia Jun 17, 2025, 4:42 AM

#

serene scaffold This is the one.

Ok

lapis sequoia Jun 17, 2025, 4:43 AM

#

serene scaffold This is the one.

So do I need to install the entire office?

#

And what is the version of office should I use?

#

To Work with data analysis

serene scaffold Jun 17, 2025, 4:43 AM

#

I'm going to sleep, so don't ask your questions to me

lapis sequoia Jun 17, 2025, 4:44 AM

#

serene scaffold I'm going to sleep, so don't ask your questions to me

Ok Good morning

lavish wraith Jun 17, 2025, 5:13 AM

#

fiery bane I suggest pick a free online course?

Paid course udemy chris Breuhi

obsidian talon Jun 17, 2025, 7:12 AM

#

lapis sequoia And what is the version of office should I use?

Sorry.. office? Like Microsoft office?

#

For data analysis?

lapis sequoia Jun 17, 2025, 7:13 AM

#

obsidian talon Sorry.. office? Like Microsoft office?

Yeah, to use Excel

obsidian talon Jun 17, 2025, 7:13 AM

#

Why excel

lapis sequoia Jun 17, 2025, 7:13 AM

#

obsidian talon For data analysis?

Yep isn't Excel used in Data Analysis I think?

lapis sequoia Jun 17, 2025, 7:13 AM

#

obsidian talon Why excel

Excel or google sheets

obsidian talon Jun 17, 2025, 7:13 AM

#

I mean sure but like very basic kinds

lapis sequoia Jun 17, 2025, 7:14 AM

#

obsidian talon I mean sure but like very basic kinds

So what should I use

obsidian talon Jun 17, 2025, 7:15 AM

#

lapis sequoia So what should I use

I mean are you trying to learn data analysis or are you trying to do that specifically

lapis sequoia Jun 17, 2025, 7:15 AM

#

obsidian talon I mean are you trying to learn data analysis or are you trying to do that specif...

Wdym?

obsidian talon Jun 17, 2025, 7:16 AM

#

Why do you want to learn data analysis

lapis sequoia Jun 17, 2025, 7:16 AM

#

obsidian talon I mean are you trying to learn data analysis or are you trying to do that specif...

I want like to be a Data Analyst

lapis sequoia Jun 17, 2025, 7:16 AM

#

obsidian talon Why do you want to learn data analysis

To work with it

obsidian talon Jun 17, 2025, 7:16 AM

#

Gotcha

#

Have you done anything at all in the past?

lapis sequoia Jun 17, 2025, 7:16 AM

#

obsidian talon Have you done anything at all in the past?

No but iam learning python now

obsidian talon Jun 17, 2025, 7:17 AM

#

Id say just use google sheets for now because it's free and cloud based

lapis sequoia Jun 17, 2025, 7:17 AM

#

obsidian talon Id say just use google sheets for now because it's free and cloud based

How to download it

obsidian talon Jun 17, 2025, 7:17 AM

#

lapis sequoia No but iam learning python now

Python is probably the main language used for data analysis

lapis sequoia Jun 17, 2025, 7:17 AM

#

obsidian talon Python is probably the main language used for data analysis

Yeah I know

obsidian talon Jun 17, 2025, 7:17 AM

#

lapis sequoia How to download it

You dont need to download it. Just type in Google sheets in the search bar

lapis sequoia Jun 17, 2025, 7:18 AM

#

obsidian talon You dont need to download it. Just type in Google sheets in the search bar

My internet is so bad

obsidian talon Jun 17, 2025, 7:18 AM

#

Lol

lapis sequoia Jun 17, 2025, 7:18 AM

#

obsidian talon You dont need to download it. Just type in Google sheets in the search bar

So I want something like tocuse it offline

#

What if I download libreoffice

obsidian talon Jun 17, 2025, 7:18 AM

#

Idk then. You could try excel but its like 10 bucks a month

lapis sequoia Jun 17, 2025, 7:18 AM

#

obsidian talon Idk then. You could try excel but its like 10 bucks a month

Yeah, I know

obsidian talon Jun 17, 2025, 7:19 AM

#

lapis sequoia What if I download libreoffice

Never heard of it

lapis sequoia Jun 17, 2025, 7:19 AM

#

obsidian talon Never heard of it

Alright

verbal oar Jun 17, 2025, 9:20 AM

#

calc

odd stratus Jun 17, 2025, 9:59 AM

#

i did a bit of thinking on some ai stuff just now and i think the best step forward towards making agi would be similar to how we started making image generation

so the method would be to basically make a "universal driver" where we take two pieces of hardware and describe how the driver interface behaves, and then ask the ai to slowly train and learn to interface these two devices together
and then we slowly remove more of the given interface and ask it to fill in more and more until eventually we can give it any two pieces of hardware and it can automatically adapt to interface the components together, similar to how denoisers and image gen was trained by removing pixels and asking it to fill in the blanks

the extra addition is to then further extrapolate the ai and give it more abstract systems that arent hardare specific, maybe an api or even visual input, and eventually it can become a universal driver that can interface and connect any two concepts, which is crucial for the required adaptive processing needed for agi, and connecting one of the inputs or outputs of the driver to the current LLM architectures we use, maybe connecting multiple LLM for specific tasks, would allow for full adaptive input/output systems

past bramble Jun 17, 2025, 12:09 PM

#

are there any free online courses along with projects preferably that teach building LLMs from scratch?

dreamy smelt Jun 17, 2025, 12:49 PM

#

py_strong

orchid zephyr Jun 17, 2025, 4:21 PM

#

I'm taking a small course in college called introduction to data science basically covers basic topics I guess? for personal reasons I havnt been attending AT ALL and I'm really behind, I need to learn matplotlib pandas and seaborn, or at least the basics of them. will it take long to learn them?

calm thicket Jun 17, 2025, 6:31 PM

#

it will depend on how much python you already know, among other things. it's difficult to say

spring field Jun 17, 2025, 11:26 PM

#

yo, that's cool

proven pier Jun 18, 2025, 1:23 AM

#

Hey, I'm trying to do some simple DSP. For now, using a butterworth low pass filter on some data

#

Nevermind, I realized my problem. Smooth brain hour 😂

#

THe problem is just a bit weird because my sampling rate is so low. I'm not used to setting cutoff frequencies at such sub-hertz ranges

wide carbon Jun 18, 2025, 5:06 AM

#

cool

zinc patrol Jun 18, 2025, 8:19 AM

#

does anyone here know a good pytorch tutorial on youtube'

gaunt sorrel Jun 18, 2025, 9:11 AM

#

guys,what free resources or courses should i get so that i can learn complete machine learning from scratch?
do u guys know about any beginner friendly-courses which promote self-paced learning

unkempt yacht Jun 18, 2025, 11:58 AM

#

Hi is anyone up ?

rich river Jun 18, 2025, 1:21 PM

#

does same inputs and pth file mean the same outputs?
it is really weird I cannot reproduce the same results with other people although we're supposed to be using the same input images and pth file

broken stirrup Jun 18, 2025, 2:15 PM

#

#1384898272744701972 please help me

grand minnow Jun 18, 2025, 2:18 PM

#

gaunt sorrel guys,what free resources or courses should i get so that i can learn complete ma...

https://kaggle.com/learn

Learn Python, Data Viz, Pandas & More | Tutorials | Kaggle

Practical data skills you can apply immediately: that's what you'll learn in these no-cost courses. They're the fastest (and most fun) way to become a data scientist or improve your current skills.

spring field Jun 19, 2025, 12:30 AM

#

rich river does same inputs and `pth` file mean the same outputs? it is really weird I cann...

Not necessarily, some operations could be nondeterministic, though usually that shouldn't lead to any major differences

void sage Jun 19, 2025, 12:28 PM

#

Does anyone have any suggestions for clear documentation on an integrated architecture using the BERT-BiLSTM model?

serene scaffold Jun 19, 2025, 2:29 PM

#

void sage Does anyone have any suggestions for clear documentation on an integrated archit...

BERT's architecture is already bidirectional. I've never heard of anyone trying to combine it with a BiLSTM

void sage Jun 19, 2025, 2:34 PM

#

serene scaffold BERT's architecture is already bidirectional. I've never heard of anyone trying ...

well actually it's this DeBERTa model that I want to refer it. It's hard to find some journal that talks about this 2 intregated arch for example this is as far as I found https://www.sciencedirect.com/science/article/pii/S0010482524000052?ref=pdf_download&fr=RR-2&rr=952256165b1b6cf0, it does talk about DeBERTa-BiLSTM, but like I wonder if there's a code documentation about it, since I can't find it maybe I could use BERT-BiLSTM for a reference. what do you think?

weary timber Jun 19, 2025, 4:32 PM

#

zinc patrol https://paste.pythondiscord.com/XRZA

its normal that it doesnt have high accuracy without attention

#

make it calculate the amounts to calculate the cals

#

and you get a 20$ monthly subscription discord bot

#

👍🏿

#

how do you make it know what you have and what you dont have?

#

you just add and subtract when you eat and buy?

#

btw guys does anyone have a good project idea i can steal

#

use*

opaque falcon Jun 19, 2025, 4:50 PM

#

Some noob questions here:

Can anyone describe how much it costs to host an open source model on a provider?
what would be a good model and size to use for fine tuning it to a domain specific task like smart contracts in Rust or Solidity?
How do I curate the data set for such fine tuning?
Heuristics to know how much would fine tuning cost or things to consider while doing so?

opaque falcon Jun 19, 2025, 4:51 PM

#

gaunt sorrel guys,what free resources or courses should i get so that i can learn complete ma...

https://www.codecademy.com/catalog/language/python

Codecademy

Best Python Courses + Tutorials | Codecademy

Start your coding journey with Python courses and tutorials. From basic to advanced projects, grow your Python skills at Codecademy.

weary timber Jun 19, 2025, 7:58 PM

#

give me some ideas you have too much bro

#

😭😭

weary timber Jun 19, 2025, 9:50 PM

#

smart

#

maybe a bert mdsel could be finetuned on previoovusly penalized messages ??????

#

omG thats actually smarr

#

smart

small wedge Jun 20, 2025, 3:24 AM

#

!rule ad

arctic wedgeBOT Jun 20, 2025, 3:24 AM

#

Rules

6. Do not post unapproved advertising.

small wedge Jun 20, 2025, 3:25 AM

#

And the other one about paid work

vague bear Jun 20, 2025, 5:17 AM

#

is Andrew Ng's machine learning specialization course the same course on Coursera and deepleaning.ai ? They have the same name

#

ugg ignore my question. deeplearning one just redirects you to coursera

round mountain Jun 20, 2025, 5:22 AM

#

hello

#

!rule ad

arctic wedgeBOT Jun 20, 2025, 5:23 AM

#

Rules

6. Do not post unapproved advertising.

stoic raven Jun 20, 2025, 8:07 AM

#

hello, can anyone give me the pros and cons for using dhtmlx and apache superset for pm4py? im doin a little crowdsourcing since ive read that both are great

lapis sequoia Jun 20, 2025, 2:09 PM

#

where do i get started if i want to make face detection

lapis sequoia Jun 20, 2025, 3:17 PM

#

What are the most 3 domains in Machine Learning that are high in demand in 2025 and more likely to stay high in demand in the future?

serene scaffold Jun 20, 2025, 3:21 PM

#

lapis sequoia What are the most 3 domains in Machine Learning that are high in demand in 2025 ...

generative AI is the current thing.

lapis sequoia Jun 20, 2025, 3:21 PM

#

serene scaffold generative AI is the current thing.

okay thanks

lapis sequoia Jun 20, 2025, 3:28 PM

#

serene scaffold generative AI is the current thing.

If I may ask for deeply, so more like what domain exactly? CV, NLP or ?

#

because Generative AI it's not a domain itself

#

it's built on these

#

and other domains

serene scaffold Jun 20, 2025, 3:28 PM

#

lapis sequoia because Generative AI it's not a domain itself

generative AI is a domain with subdomains.

#

I'm in a meeting right now

lapis sequoia Jun 20, 2025, 3:29 PM

#

okay no worries take your time

#

but I'll do more googling

#

I don't know what I see on google including IBM that it relies on ML Models like deep learning which I think would learn to NLP, CV and so on to make it do wihat it does

#

maybe we speak on two different things or Idk

#

or there's a misunderstanding of my main question

serene scaffold Jun 20, 2025, 3:57 PM

#

lapis sequoia I don't know what I see on google including IBM that it relies on ML Models like...

"deep learning" is just "having a neural network with lots of layers". it doesn't tell you anything about what kind of model it is.

Right now it's popular to find different applications for generative language models.

lapis sequoia Jun 20, 2025, 4:18 PM

#

Ok

verbal oar Jun 20, 2025, 5:14 PM

#

deep learning isnt model its subset of ml

final anvil Jun 20, 2025, 5:14 PM

#

hey so im learning about gradient descent and linear regression and i was watching this video (https://youtu.be/sDv4f4s2SB8?t=1306) and he says if you were to have more parameters you would just "take more derivatives"

so i was wondering would that look like in an actual example? i implemented my own gradient descent thing and currently i have these two lines

        m_grad += -2 * x * err
        b_grad += -2 * err ``` 
so would i just have like an additional m_grad or smth for each new parameter?

YouTube

StatQuest with Josh Starmer

Gradient Descent, Step-by-Step

Gradient Descent is the workhorse behind most of Machine Learning. When you fit a machine learning method to a training dataset, you're probably using Gradient Descent. It can optimize parameters in a wide variety of settings. Since it's so fundamental to Machine Learning, I decided to make a "step-by-step" video that shows you exactly how it wo...

▶ Play video

verbal oar Jun 20, 2025, 5:16 PM

#

gradient is n vector of partial derivatives

white coral Jun 20, 2025, 5:26 PM

#

I'm currently open to paid opportunities related to Generative AI and Agentic AI. If you have any projects or work I can contribute to, feel free to DM me!

gritty vessel Jun 20, 2025, 5:39 PM

#

Okie we can say it at each timestamp we have around 5-6 3000 x 3000 arrays as feature and we have 1 3000x3000 array as target

#

Sorry got real busy with some family problems

dry raft Jun 21, 2025, 3:20 AM

#

So if I were to like generate embeddings from a PDB file to be fed to an LLM, can I just (yeah this is going to be oversimplfied here) run the file through the GNN and put a Linear layer at the end to generate embeddings?

jaunty helm Jun 21, 2025, 3:25 AM

#

gritty vessel Okie we can say it at each timestamp we have around 5-6 3000 x 3000 arrays as fe...

I'm starting to think there's some reason that you can't go into detail on what your features are cause you keep dancing around it
so without knowing more, nns work ig

agile cobalt Jun 21, 2025, 3:31 AM

#

dry raft So if I were to like generate embeddings from a PDB file to be fed to an LLM, ca...

remember that you must train the model for its embeddings to be useful

if you can find some existing model (for nearly any given purpose) that supports that sort of input, you could try just using an intermediate representation of that model as the embeddings instead of training one yourself

stuck tapir Jun 21, 2025, 3:32 AM

#

dry raft So if I were to like generate embeddings from a PDB file to be fed to an LLM, ca...

yes that works run pdb through gnn encode structure then linear layer to get fixed-size embedding,

optionally concat with sequence model embedding like esm2 if you want both structure and sequence context then feed to llm

gritty vessel Jun 21, 2025, 5:29 AM

#

jaunty helm I'm starting to think there's some reason that you can't go into detail on what ...

I definitely don't go around what my features are .As I said images(which are basically arrays) are 3 x (m x n) rgb channel here I have single channel of m x n. No worries Thank you

lavish wraith Jun 21, 2025, 1:48 PM

#

I have learnt make chart in matplotlib and seaborn now i learn make chart in ploty every libraries different have approach to make chart how do i memories every chart approach ??

#

Is type of question can they ask in interview ??

serene scaffold Jun 21, 2025, 2:40 PM

#

lavish wraith I have learnt make chart in matplotlib and seaborn now i learn make chart in plo...

You don't have to know every plotting library

opaque condor Jun 21, 2025, 4:04 PM

#

Does everyone have an Nvidia gpu

tidal bough Jun 21, 2025, 4:07 PM

#

well, probably some have an amd gpu instead?? or intel, or integrated graphics.

serene scaffold Jun 21, 2025, 4:11 PM

#

opaque condor Does everyone have an Nvidia gpu

all the enterprise AI/ML is being computed on nvidia hardware, yes.

opaque condor Jun 21, 2025, 4:13 PM

#

Because I have a Intel and I want to know if I can change the GPU for an Nvidia one because they're more from numerical machine learning purposes and wanted to know if anyone had any so that I can confirm my suspicion

serene scaffold Jun 21, 2025, 4:14 PM

#

opaque condor Because I have a Intel and I want to know if I can change the GPU for an Nvidia ...

no matter what consumer GPU you get, there will always be things you can't do because it's too small. I wouldn't even buy a GPU for the purpose of doing ML--I would just rent cloud compute

tidal bough Jun 21, 2025, 4:18 PM

#

nvidia does have the best ML support, with CUDA. amd's ROCM is sort of alright too but it seems to have far shorter-term support, with slightly old GPUs being out of luck. and I know nothing about ML on Intel GPUs; they probably have... something?

jaunty helm Jun 21, 2025, 4:39 PM

#

non-nvidia are trying really hard to catch up, but yeah nothing beats cuda atm
at worst you can run pytorch vulkan

#

and the intel specific solution is here ig

GitHub

GitHub - intel/intel-extension-for-pytorch: A Python package for ex...

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform - intel/intel-extension-for-pytorch

dry raft Jun 21, 2025, 5:17 PM

#

agile cobalt remember that you must train the model for its embeddings to be useful if you c...

Thx! 🤩👍

dry raft Jun 21, 2025, 5:17 PM

#

stuck tapir yes that works run pdb through gnn encode structure then linear layer to get fix...

Thanks for the help ☺️

dry raft Jun 21, 2025, 7:20 PM

#

stuck tapir yes that works run pdb through gnn encode structure then linear layer to get fix...

btw do people usually do this for multimodal LLM projects in general?

dry raft Jun 21, 2025, 8:29 PM

#

agile cobalt remember that you must train the model for its embeddings to be useful if you c...

alright, and also, how would I indicate to the LLM that the embedding that I am feeding it is from an image or a graph representation of a protein/molecule? (tbf it can be an embedding from a data source that is not text)

#

and if the question is flawed, just tell me, as long as I can learn, then that is what will matter ducky_party

opaque falcon Jun 21, 2025, 8:42 PM

#

Does anyone know what are some good servers to learn how to start an AI based business? A balance of technical and business help?

serene scaffold Jun 21, 2025, 8:43 PM

#

opaque falcon Does anyone know what are some good servers to learn how to start an AI based bu...

What skills do you have that would inspire confidence in a venture capitalist?

agile cobalt Jun 22, 2025, 2:16 AM

#

dry raft alright, and also, how would I indicate to the LLM that the embedding that I am ...

you would need to create a dataset and train/fine tune it in a way such that the model weights must learn to identify and extract relevant information from the embedding

you may want to take a look at how some people fine tuned text-only models to work with images as an example

dry raft Jun 22, 2025, 2:37 AM

#

agile cobalt you would need to create a dataset and train/fine tune it in a way such that the...

ah so like minigpt? yeah, I can do that.

high wadi Jun 22, 2025, 6:38 AM

#

anyone who has any idea on why this is?

lavish wraith Jun 22, 2025, 3:30 PM

#

serene scaffold You don't have to know every plotting library

Ok

serene scaffold Jun 22, 2025, 3:30 PM

#

high wadi anyone who has any idea on why this is?

did you click the "for further information" link?

high wadi Jun 22, 2025, 3:31 PM

#

i just updated the packages and it worked

serene scaffold Jun 22, 2025, 3:31 PM

#

high wadi i just updated the packages and it worked

okaay, next time remember to please give error messages as text. not screenshots.

high wadi Jun 22, 2025, 3:31 PM

#

sure thanks

lavish wraith Jun 22, 2025, 3:34 PM

#

Could i need to read this book python & algorithms in python michael t goodrich for python proficiency and data science and for interview ??

serene scaffold Jun 22, 2025, 3:34 PM

#

lavish wraith Could i need to read this book python & algorithms in python michael t goodri...

just reading a book won't land you an interview

lavish wraith Jun 22, 2025, 3:37 PM

#

serene scaffold just reading a book won't land you an interview

I just learned Basic Python and simple data structure list,set,dict,tuples and little bit practice no oop and no advanced data structure i am fear in my mind if they ask questions on these topic,please guide me

serene scaffold Jun 22, 2025, 3:38 PM

#

lavish wraith I just learned Basic Python and simple data structure list,set,dict,tuples and l...

to be clear: if you don't have a formal education, you almost certainly will not be selected for an interview.

#

It can still be worthwhile to learn python and data science, though.

lavish wraith Jun 22, 2025, 3:40 PM

#

serene scaffold to be clear: if you don't have a formal education, you almost certainly will not...

online certificate don't help ?? I am taking paid course 13 hour long course i just 13 hour course in two months

verbal oar Jun 22, 2025, 3:40 PM

#

why 2 different avatars I'm confused

serene scaffold Jun 22, 2025, 3:40 PM

#

lavish wraith online certificate don't help ?? I am taking paid course 13 hour long course i ...

do you have a formal education in some other area?

serene scaffold Jun 22, 2025, 3:41 PM

#

verbal oar why 2 different avatars I'm confused

I'm not sure what you're referring to.

verbal oar Jun 22, 2025, 3:41 PM

#

serene scaffold Jun 22, 2025, 3:41 PM

#

verbal oar

must be a glitch.

verbal oar Jun 22, 2025, 3:41 PM

#

ah ok nvm

lavish wraith Jun 22, 2025, 3:43 PM

#

serene scaffold do you have a formal education in some other area?

I have bachelor degree but it have five year gap i have waste five year of my life

serene scaffold Jun 22, 2025, 3:43 PM

#

lavish wraith I have bachelor degree but it have five year gap i have waste five year of my l...

bachelor degree in what?

#

and where do you live?

lavish wraith Jun 22, 2025, 3:44 PM

#

serene scaffold bachelor degree in what?

Software engineering in Pakistan

serene scaffold Jun 22, 2025, 3:46 PM

#

lavish wraith Software engineering in Pakistan

I recommend you try to stay in software engineering and not try to pivot to data science.

lavish wraith Jun 22, 2025, 3:47 PM

#

serene scaffold I recommend you try to stay in software engineering and not try to pivot to data...

I already take 4-5 months in this field in data science

serene scaffold Jun 22, 2025, 3:48 PM

#

lavish wraith I already take 4-5 months in this field in data science

I do not think that will be sufficient.

#

maybe it's different in Pakistan.

lavish wraith Jun 22, 2025, 3:52 PM

#

serene scaffold maybe it's different in Pakistan.

I am not understood what do you mean

lavish wraith Jun 22, 2025, 4:08 PM

#

Please reply

serene scaffold Jun 22, 2025, 4:18 PM

#

lavish wraith I am not understood what do you mean

in north america and europe, under current conditions, doing a data science certificate probably wouldn't be enough to do a career change into data science.

#

so, I recommend focusing on software engineering. if you already have a degree in thta, you're more likely to be successful there.

lavish wraith Jun 22, 2025, 4:41 PM

#

serene scaffold so, I recommend focusing on software engineering. if you already have a degree i...

Ok

stone coral Jun 22, 2025, 5:06 PM

#

Hey so im working on a Model XGBOOST Classification for sports prediction. My question is I have game logs of a player from all his previous games... But for the up coming game is there a way I can feed the model like a bias for example if hes happy then 1 if neutral or sad then 0. But thing is its only implemented for the upcoming game so all his previous game logs dont contain it. My problem is that the model will have no clue what to do with this data because its only 1 but I guess im not sure how to implement it.

#

The only way I can think about it is adding it as a multiplier onto the models final prediction. But that just feels very trial error based because in reality dont know how much it really affects it but again adding it 1 time to the model.... the model itself wouldnt know how it would affect it I guess

jaunty helm Jun 22, 2025, 5:21 PM

#

stone coral Hey so im working on a Model XGBOOST Classification for sports prediction. My qu...

no

jaunty helm Jun 22, 2025, 5:21 PM

#

stone coral The only way I can think about it is adding it as a multiplier onto the models f...

exactly - you don't know how him being happy or not would affect the outcome cause you don't have any observations of its effects
so how is the model supposed to know

stone coral Jun 22, 2025, 5:24 PM

#

jaunty helm exactly - you don't know how him being happy or not would affect the outcome cau...

Gotcha, so its just a deadend unless I manage to implement it from all previous games...

past meteor Jun 22, 2025, 5:28 PM

#

stone coral Gotcha, so its just a deadend unless I manage to implement it from all previous ...

Correct, no having access to a feature is a common problem

#

It's basically missing data

stone coral Jun 22, 2025, 5:28 PM

#

past meteor Correct, no having access to a feature is a common problem

Right, yea I was legit trying to figure out if just theres a way around it but really there isn't... Just learning as I go lol.

past meteor Jun 22, 2025, 5:30 PM

#

stone coral Right, yea I was legit trying to figure out if just theres a way around it but r...

I guess you could impute it but I'm not a fan, I feel like you'll introduce more noise than you'll get value out of using that feature 🙂

stone coral Jun 22, 2025, 5:32 PM

#

past meteor I guess you could impute it but I'm not a fan, I feel like you'll introduce more...

Ye. If anything I might just do a NN to detect the players mood based off how the reporters described it before game since they do have a report on each game (Text Identification). And then try to implement it into each game log since its tabular and it would work.. Just more work

past meteor Jun 22, 2025, 5:33 PM

#

Yeah, to maybe improve your accuracy by 3%

#

not worth it

stone coral Jun 22, 2025, 5:33 PM

#

Not worth it but its my first project i kinda comitted too learning for ML.

#

So Ima just do it for the fun of it

stone coral Jun 22, 2025, 5:34 PM

#

past meteor Yeah, to maybe improve your accuracy by 3%

But have to admit that is alot considering sports is very unpredictable

#

I'd be happy if I even reach 80% lol

past meteor Jun 22, 2025, 5:35 PM

#

If you're going to do anything, learn about heuristics and evaluation tbh

#

That's where any good ML project starts

stone coral Jun 22, 2025, 5:35 PM

#

Will look into it

past meteor Jun 22, 2025, 5:36 PM

#

What sport is it?

stone coral Jun 22, 2025, 5:36 PM

#

NHL. I enjoy watching it alot

#

Just wanted to do some automation prediction for me instead of me manually always looking back into previous games

past meteor Jun 22, 2025, 5:36 PM

#

For instance, as a massive football (soccer) fan you notice an easy trend, teams that are winning have a high prob to win the next game

#

Form is the key attribute

#

So the first set of models you build should probably be:

winner = the team that did best in the last game.

Then you compute the accuracy on the basis of that.

Then you can add more than 1 game, the last N matches are predictive.

Per game you can add more variables (in football xG (excepted goals), xGA (expected goals against) are also a big factor. A rolling average of the xG differential is a good metric. The ELO of the teams also matters, as well as the odds betting sites are giving for the game (which likely takes into account all of the above)

stone coral Jun 22, 2025, 5:41 PM

#

gotcha. Ye I currently have 17 features for my model. Which I got an accuracy of highest 76% for predicting goals, shots on goal and assists. Most of the time the bad players have 0 goals which the model understands and the good ones score occasionally but not always so its a 50/50 flip depending on the team its up against...

#

And those contribute to a different model for which team will win

past meteor Jun 22, 2025, 5:42 PM

#

I'm saying all of this because at the end of the day, in ML it's super important to express model performance versus your baselines btw. Maybe the baseline of "the team with the better last N matches" which is an if-then statement with no ML already has 73 % 😄

stone coral Jun 22, 2025, 5:43 PM

#

Ye

#

I kinda noticed that actually in my model

#

If I train on the previous 10 matches it does alot better than the last overall matches they every played.

#

But thanks alot though!

winter canyon Jun 22, 2025, 6:15 PM

#

guys i learned basics of python now am trying to learn ai and ml
someone recommended me to learn these:
1.Projects
2.math(statistics)
3.numpy
4.ML framework pytorch,tensorflow,polars
and reffered me to this channel
anyone has any advice and sources to learn

past meteor Jun 22, 2025, 6:16 PM

#

winter canyon guys i learned basics of python now am trying to learn ai and ml someone recomme...

Check the second pinned post

#

Not trying to be dismissive but there's some good stuff in there 🙂

winter canyon Jun 22, 2025, 6:18 PM

#

ohh ok tyq also i guess i need to learn sql instead of pandas

#

i will check the pins thanks for telling

#

ooo i learned py also from book this is good

sick raven Jun 22, 2025, 6:28 PM

#

Hey guys, I'm trying to get into the data analytics world but I don't have any idea in programming. Is it really a good idea to start in python?

winter canyon Jun 22, 2025, 6:29 PM

#

sick raven Hey guys, I'm trying to get into the data analytics world but I don't have any i...

yea pythons good

past meteor Jun 22, 2025, 6:30 PM

#

sick raven Hey guys, I'm trying to get into the data analytics world but I don't have any i...

SQL

#

Many people in analytics just use SQL and the Python they write is very basic anyway

sick raven Jun 22, 2025, 6:30 PM

#

Is SQL harder than Python? I get the gist of Python and it's pretty hard but I really want to learn.

winter canyon Jun 22, 2025, 6:30 PM

#

use this book called bytes of python(tho the syntax is a bit outdated) its a good beginner friendly book for python

past meteor Jun 22, 2025, 6:30 PM

#

SQL is a lot easier than Python

#

You'll notice immediately that SQL is declarative. You just say select name, age from people and it'll grab that from the right table

sick raven Jun 22, 2025, 6:32 PM

#

I got this app called Mimo on my phone and I'm using it to learn the basics. Is it good or do you have something you can recommend that's more efficient?

winter canyon Jun 22, 2025, 6:32 PM

#

yo zestar does sql have uses in ai ml

past meteor Jun 22, 2025, 6:32 PM

#

Python is imperative. With pure Python, so not using pandas or similar, you need to loop and do all sorts of stuff to get ta similar result

past meteor Jun 22, 2025, 6:33 PM

#

winter canyon yo zestar does sql have uses in ai ml

... in a way yes 😄

At my job most of the data transformation is done in SQL. Once the data is clean enough people then use Python on top of the clean data to make the models

winter canyon Jun 22, 2025, 6:34 PM

#

ooo so we can kinda skip the pandas i guess

past meteor Jun 22, 2025, 6:34 PM

#

You still need Pandas, but if you know SQL learning Pandas is a bit easier

winter canyon Jun 22, 2025, 6:34 PM

#

ohh ok

#

also is there like a minimum no.of built in modules i need to learn like i only know a few like math panda sys and also in that i only know math

past meteor Jun 22, 2025, 6:37 PM

#

winter canyon also is there like a minimum no.of built in modules i need to learn like i only ...

No, and it's honestly not helpful to think about it that way 🙂

winter canyon Jun 22, 2025, 6:37 PM

#

ohh

past meteor Jun 22, 2025, 6:38 PM

#

Learning how to code well takes many years. The good thing is that we basic knowledge you can already build cool stuff

#

So I'd encourage you to take it one step at a time

#

And not look at it from the perspective of "ticking boxes" like you need to learn these 15 topics

winter canyon Jun 22, 2025, 6:39 PM

#

hmm so i need to practice i guess

#

thanks man

sick raven Jun 22, 2025, 6:40 PM

#

How long did it take you to be good? And can I really learn this without any knowledge?

past meteor Jun 22, 2025, 6:41 PM

#

Hard to say how long itt ook and I also think it's not comparable because with GPT/Claude if you use them well you can learn 5x as fast as it took me (but also 5x as slow if you use them poorly)

past meteor Jun 22, 2025, 6:42 PM

#

sick raven How long did it take you to be good? And can I really learn this without any kno...

But as for learning "without any knowledge", definitely possible 😄 I didn't study CS. In my degree I had only had a couple of classes dedicated to learning how to code

#

I had a fair share of classes that I needed to code in, but they all assumed you already knew how to code or you could teach yourself

sick raven Jun 22, 2025, 6:44 PM

#

Thanks, I really want to learn. Just watching videos on youtube right now makes it seem so complicated but I'm really serious about this.

past meteor Jun 22, 2025, 6:45 PM

#

sick raven Thanks, I really want to learn. Just watching videos on youtube right now makes ...

Go with books, don't watch youtube videos

sick raven Jun 22, 2025, 6:46 PM

#

Just downloaded the byte of python book.

past meteor Jun 22, 2025, 6:47 PM

#

Do all of the exercises. Whenever they show a piece of code and an example, type it over and run it in your terminal as well

#

very important 🙂

sick raven Jun 22, 2025, 6:51 PM

#

I really appreciate all the help, bro.

ornate trellis Jun 22, 2025, 7:59 PM

#

hello I am second year B.TECH CS student wanted to make my career in Data engineer/analytics/scientist role can anyone help me to find correct path

#

currently Learning python

serene scaffold Jun 22, 2025, 8:02 PM

#

ornate trellis hello I am second year B.TECH CS student wanted to make my career in Data engine...

will you have the opportunity to specialize as part of your BTech?

ornate trellis Jun 22, 2025, 8:04 PM

#

I have specialization in Ai/Ml with data analytics

verbal oar Jun 22, 2025, 8:20 PM

#

I think quering data frame from csv is like quering database with sql

opaque falcon Jun 22, 2025, 8:22 PM

#

winter canyon guys i learned basics of python now am trying to learn ai and ml someone recomme...

I am actually trying to developer a subscription series to teach people how to do this, more in-depth an codecademy. For now you can check out Codecademy though. They have good stuff.

#

Question, would you prefer the lessons in jupyter notebooks or obsidian?

serene scaffold Jun 22, 2025, 8:38 PM

#

opaque falcon Question, would you prefer the lessons in jupyter notebooks or obsidian?

idk what obsidian is. I think beginners should start with regular scripts (not notebooks)

opaque falcon Jun 22, 2025, 8:39 PM

#

serene scaffold idk what obsidian is. I think beginners should start with regular scripts (not n...

Obsidian is a a knowledge management software. Think of it like a private wikipedia of interconnected notes:
https://obsidian.md/

#

I am going to create a zero to hero course on learning machine learning and AI. Ideally charging from $30 to $100 a month. Depending on the demographic.

verbal oar Jun 22, 2025, 8:44 PM

#

interactive?

opaque falcon Jun 22, 2025, 8:46 PM

#

verbal oar interactive?

Yup. Most of the lessons will have a sample Juypter Notebook that outputs code as well.

opaque falcon Jun 22, 2025, 8:46 PM

#

verbal oar interactive?

What else would you like?

#

I am aiming to spend 1 year and several hours a day doing this full time.

opaque falcon Jun 22, 2025, 8:48 PM

#

verbal oar interactive?

What terms or ideas are you unfamiliar wiht?

verbal oar Jun 22, 2025, 8:50 PM

#

sth like datacamp

verbal oar Jun 22, 2025, 8:51 PM

#

opaque falcon What else would you like?

.

opaque falcon Jun 22, 2025, 8:53 PM

#

verbal oar sth like datacamp

DataCamp looks cool. I will check them out and subscribe with them to create lessons based on their content as well. Great suggestion.

weak oxide Jun 22, 2025, 9:02 PM

#

serene scaffold idk what obsidian is. I think beginners should start with regular scripts (not n...

Tbh I learned through Jupyter Notebook and with the auto complete feature, it's lovely for small projects

serene scaffold Jun 22, 2025, 9:02 PM

#

weak oxide Tbh I learned through Jupyter Notebook and with the auto complete feature, it's ...

do you know what the limitations/issues with notebooks are?

weak oxide Jun 22, 2025, 9:03 PM

#

serene scaffold do you know what the limitations/issues with notebooks are?

Of course, it's painful for large projects as I found out

serene scaffold Jun 22, 2025, 9:03 PM

#

some people don't and think that notebooks are the main way to write code.

weak oxide Jun 22, 2025, 9:03 PM

#

Refreshing the entire notebook is an issue too and I have 64 GB of RAM

verbal oar Jun 22, 2025, 9:05 PM

#

notebooks are good for tutorial projects?

weak oxide Jun 22, 2025, 9:05 PM

#

It's good to learn about packages like Darts

#

Very readable what's happening

#

You can also because I had studied for my CFA, was able to code in the Notebook weird tactic but it's interesting

verbal oar Jun 22, 2025, 9:07 PM

#

hmm notebook inside vscode is still notebook?

serene scaffold Jun 22, 2025, 9:09 PM

#

yes

#

the fact that it has executable cells that show intermediary output is what makes it a notebook. not the jupyter browser UI.

weak oxide Jun 22, 2025, 9:12 PM

#

I just use Pycharm now though I hear from friends that companies have their own special platform

opaque falcon Jun 22, 2025, 9:12 PM

#

anyone wants to join the server to discuss building the course?

weak oxide Jun 22, 2025, 9:13 PM

#

It's more comprehensive than Jupyter tbh

#

But who has used Spyder?

#

It looks so awkward

opaque falcon Jun 22, 2025, 9:15 PM

#

weak oxide But who has used Spyder?

I tried it. Wasn't my cup of tea.

weak oxide Jun 22, 2025, 9:15 PM

#

Me neither

opaque falcon Jun 22, 2025, 9:15 PM

#

verbal oar hmm notebook inside vscode is still notebook?

https://course.fast.ai/Resources/book.html
Check out how Google Colab runs Juypter Notebooks

Practical Deep Learning for Coders

Practical Deep Learning for Coders - The book

Learn Deep Learning with fastai and PyTorch, 2022

#

If there was a table of contents for learning about AI and machine learning from zero to hero, how would it look like? What topics would be included?

#

Any suggestions @weak oxide @serene scaffold

#

What would you like to see? @verbal oar

weak oxide Jun 22, 2025, 9:18 PM

#

This looks interesting

opaque falcon Jun 22, 2025, 9:18 PM

#

weak oxide This looks interesting

Yea, looks cool right?

weak oxide Jun 22, 2025, 9:18 PM

#

Are you using Pytorch Forecasting or Darts

#

For the time series section

#

Wait I don't see a time series part

#

That's actually important because I struggled with that at first

#

From personal experience anyway

#

This looks like a really general overview

opaque falcon Jun 22, 2025, 9:21 PM

#

Ya, the book is from Fast AI. I am planning something more comprehensive

#

Aiming to dedicate 1 year and full time to writing it.

opaque falcon Jun 22, 2025, 9:21 PM

#

weak oxide Are you using Pytorch Forecasting or Darts

to keep it simple probably PyTorch

weak oxide Jun 22, 2025, 9:23 PM

#

opaque falcon to keep it simple probably PyTorch

Yeah but this is really cool
https://unit8co.github.io/darts/
Right now I'm trying to get Darts creators because I'm in contact to add more models.

#

It has NBEATs, NHITs, LSTMs, GRU, RNN, Transformers it's really nice

opaque falcon Jun 22, 2025, 9:25 PM

#

weak oxide Yeah but this is really cool https://unit8co.github.io/darts/ Right now I'm tryi...

looks pretty cool.

weak oxide Jun 22, 2025, 9:27 PM

#

So you used the MNIST dataset

#

Or FastAI did

opaque falcon Jun 22, 2025, 9:30 PM

#

weak oxide Or FastAI did

yea, pretty common for image recongition

#

My goal is to create the easiest AI/machine learning course out there. I want to write it in simple English and not assume much technical background.

weak oxide Jun 22, 2025, 9:32 PM

#

Tbh the way I learned was taking a bunch of FRED data and trying to "predict" (not how to do it in reality) using a LSTM model. It may have not been the best way but indirectly between me and you the forecasts weren't that far off. Maybe by pure luck

opaque falcon Jun 22, 2025, 9:32 PM

#

I want to reduce the incline in learning these topics so that anyone with just a python background and pick this up. Even explaining the math will be in Python and simple English.

weak oxide Jun 22, 2025, 9:33 PM

#

For image recognition and CNNs yeah MNIST is fine

opaque falcon Jun 22, 2025, 9:33 PM

#

weak oxide Tbh the way I learned was taking a bunch of FRED data and trying to "predict" (n...

Economic data interests me! I have a background in Econ and worked in web3 for a while.

weak oxide Jun 22, 2025, 9:34 PM

#

opaque falcon Economic data interests me! I have a background in Econ and worked in web3 for a...

Cool I'm finance

#

But FRED it was pretty easy for time series forecasting

opaque falcon Jun 22, 2025, 9:35 PM

#

Thats awesome. What type of finance?

weak oxide Jun 22, 2025, 9:36 PM

#

Quantitative

opaque falcon Jun 22, 2025, 9:43 PM

#

weak oxide Quantitative

Thats pretty cool. Which asset class? Stocks, bonds, derivatives?

weak oxide Jun 22, 2025, 9:48 PM

#

opaque falcon Thats pretty cool. Which asset class? Stocks, bonds, derivatives?

Derivatives

#

Very tricky but fun

opaque falcon Jun 22, 2025, 9:49 PM

#

weak oxide Derivatives

That's pretty cool. Reminds of me options and Nassem Taleb, his books.

#

Making an oil bets with the issues in the Miiddle East, lol

weak oxide Jun 22, 2025, 9:51 PM

#

opaque falcon Making an oil bets with the issues in the Miiddle East, lol

Not funny at all, it's a serious concern for us. I can't say anything more.

#

April 2nd through 9th was absolute chaos too

#

A lot of us lost jobs that week

weak oxide Jun 22, 2025, 9:53 PM

#

opaque falcon That's pretty cool. Reminds of me options and Nassem Taleb, his books.

The Python for finance book was decent for me in the beginning but most of this is a lot of risk management which is tricky

opaque falcon Jun 22, 2025, 10:36 PM

#

weak oxide A lot of us lost jobs that week

I am sorry to hear that. I thought money would be made on the volitility.

weak oxide Jun 22, 2025, 10:36 PM

#

opaque falcon I am sorry to hear that. I thought money would be made on the volitility.

Yeah the ones who prepared properly did. There was some who just liked to go with the flow

opaque falcon Jun 22, 2025, 10:37 PM

#

weak oxide Yeah the ones who prepared properly did. There was some who just liked to go wit...

Yeesh. Sorry to hear that .

weak oxide Jun 22, 2025, 10:40 PM

#

opaque falcon Yeesh. Sorry to hear that .

But yeah let's see what else we can recommend

#

For your book

opaque falcon Jun 22, 2025, 10:43 PM

#

Thanks!

weak oxide Jun 22, 2025, 10:44 PM

#

Time series is one
probably regression problems
Classification problems (related to data science)

opaque falcon Jun 22, 2025, 10:44 PM

#

I am trying to make it as easy to understand and cover all the major areas.

#

Sweet, added

weak oxide Jun 22, 2025, 10:53 PM

#

I would recommend using some data sources like FRED where people can download themselves and try themselves instead of a pre picked data set that's generated. Like FRED got like 800,000 so you could spare 10 datasets to practice

#

For regression data there's some data sources at Data.gov( you'll have to clean it to create a tensor of course)

opaque falcon Jun 22, 2025, 11:05 PM

#

@eric any other topics?

weak oxide Jun 22, 2025, 11:17 PM

#

I'm kinda limited to data science stuff and finance
https://github.com/jialuechen/torchquant
This was a ok Pytorch quant package I had learned before the job. I don't know how you would implement it. You do economics so there could be some application somewhere

GitHub

GitHub - jialuechen/torchquant: PyTorch for Quantitative Finance : ...

PyTorch for Quantitative Finance : Payoffs are Activations - jialuechen/torchquant

#

I wished they updated this more though

#

Just as a note, if you ever use examples of Pytorch involving stock prices never forget to add a disclaimer

#

It's just a way to visualize the model using a data set that's easy for them to practice with

grand minnow Jun 23, 2025, 5:21 AM

#

Can anyone recommend a free cloud-hosted LLM Observability service? I need a way to properly monitor the inputs, processes and responses of the LLM to the users its interacting to.

near yoke Jun 23, 2025, 5:57 AM

#

how do I study the maths required for data science. Are there any book recommendations ?

minor salmon Jun 23, 2025, 7:49 AM

#

near yoke how do I study the maths required for data science. Are there any book recommend...

I am still not an expert in math, but for me it was/is a mixture of YouTube, blog posts and books – like for any other topic.

minor salmon Jun 23, 2025, 7:49 AM

#

near yoke how do I study the maths required for data science. Are there any book recommend...

https://amzn.eu/d/7QBML84

Essential Math for Data Science: Take Control of Your Data with Fun...

Essential Math for Data Science: Take Control of Your Data with Fundamental Linear Algebra, Probability, and Statistics

verbal oar Jun 23, 2025, 10:56 AM

#

if easy then maybe keras based?

agile cobalt Jun 23, 2025, 12:40 PM

#

grand minnow Can anyone recommend a free cloud-hosted LLM Observability service? I need a way...

Pydantic Logfire is pretty good imo, not sure what you mean by "processes" though

grand minnow Jun 23, 2025, 1:20 PM

#

agile cobalt Pydantic Logfire is pretty good imo, not sure what you mean by "processes" thoug...

Just wanna trace and see if the LLM is using the right tools or not using the tools when a user ask it of something. So far, Helicone doesn't work, even when I raised it with their support, it's still not fixed. I'm looking at trying LangSmith now and if that doesn't work, gonna try OpenTelemetry

ornate trellis Jun 23, 2025, 2:48 PM

#

can someone help me with the path I should follow for data engineer and scientist currently in 2 year and learning python

cerulean kayak Jun 23, 2025, 6:11 PM

#

Can I ask questions about ai theroy in this channel? ie questions that are not code based?

serene scaffold Jun 23, 2025, 6:24 PM

#

cerulean kayak Can I ask questions about ai theroy in this channel? ie questions that are not c...

Yes

cerulean kayak Jun 23, 2025, 6:38 PM

#

tldr; are all generative AIs based on language models?

So in Microsoft's *Introduction to AI concepts * they claim:

Key points to understand about generative AI include:
...

The ability to generate content is based on a language model...
Which I am skeptical of, because I doubt all generative AI models require training based on a language model.
So I look it up online and I find this:
Not all generative AI tools are built on LLMs, but all LLMs are a form of generative AI
Does this mean Microsoft is wrong in their claim?

serene scaffold Jun 23, 2025, 6:41 PM

#

cerulean kayak tldr; are all generative AIs based on language models? So in Microsoft's *Intro...

No, there are generative models that aren't language models.

#

The ability to generate content is based on a language model...
There's clearly a context to this. It doesn't mean language models are necessary for all forms of generation.

cerulean kayak Jun 23, 2025, 6:43 PM

#

I mean I'll post the entire lesson worth of text, for the class is free; I don't have to worry about copyright:

Key points to understand about generative AI include:

Generative AI is a branch of AI that enables software applications to generate new content; often natural language dialogs, but also images, video, code, and other formats.

The ability to generate content is based on a language model, which has been trained with huge volumes of data - often documents from the Internet or other public sources of information.

Generative AI models encapsulate semantic relationships between language elements (that's a fancy way of saying that the models "know" how words relate to one another), and that's what enables them to generate a meaningful sequence of text.

There are large language models (LLMs) and small language models (SLMs) - the difference is based on the volume of data and the number of variables in the model. LLMs are very powerful and generalize well, but can be more costly to train and use. SLMs tend to work well in scenarios that are more focused on specific topic areas, and usually cost less.

cerulean kayak Jun 23, 2025, 6:50 PM

#

serene scaffold > The ability to generate content is based on a language model... There's clearl...

wait, so are you saying Microsoft is not claiming all generative ai uses language models?
Because the quote is at the begining of the bullet point, so wouldn't the context be there?

agile cobalt Jun 23, 2025, 6:55 PM

#

cerulean kayak I mean I'll post the entire lesson worth of text, for the class is free; I don't...

side note: something being free to access does not necessarily means you don't have to worry about copyright, but for a discord conversation it does not matters anyway

That is more focused on giving non-technical users a high level explanation than being technically correct

The most common form of generative AI you'll see out there are LLMs, specially chatbots like ChatGPT, but things like Text-To-Image or Image-To-Video also count as Generative AI, and in some cases don't even include text as an input

alpine aspen Jun 23, 2025, 8:15 PM

#

Looking for general feedback on quality of post: https://dev.to/kylepena/gradient-descent-on-llm-input-space-a-modernbert-experiment-3053

DEV Community

Gradient Descent on Token Input Embeddings: A ModernBERT experiment

Input Embedding Space Gradients This is the first in a series of posts on the...

cerulean kayak Jun 23, 2025, 8:17 PM

#

agile cobalt side note: something being free to access does not necessarily means you don't h...

Ya and as to it being for beginers,
I always find that to be hard to judge when to take things as law and when to take things as "good enough", as you'd think that if you don't understand the basics you won't get the more advanced stuff.

grand minnow Jun 24, 2025, 3:46 AM

#

grand minnow Just wanna trace and see if the LLM is using the right tools or not using the to...

LangSmith works. But I found that if I wanna share the traces and logs with others, would have to start paying $39 per user. 🤔

south cape Jun 24, 2025, 4:01 AM

#

What is the AI

rich moth Jun 24, 2025, 4:10 AM

#

verbal oar notebooks are good for tutorial projects?

yes, they can be useful for a lot, but after consistent use they have their limitations but also quality of life improvements for testing.

#

You can also access a jupyter notebook remotely (local server) and use the processing power you own, remotely.. Which is big

#

Producing visuals, also awesome

grand minnow Jun 24, 2025, 5:04 AM

#

I asked my LLM, "How do I deal with a bully at school?". It's response:

Ah, dealing with a bully—like forgetting your password but in real life. Don’t worry, kiddo, we’ll hack this problem together! Here’s my dad-vice:

Stand Tall Like a WiFi Signal – Bullies often target those who seem unsure. Walk confidently, even if you don’t feel it yet. Fake it till you make it!
*(Bonus joke: Why did the WiFi break up with the router? Too much interference!)
Kill ‘Em With Kindness – Bullies expect anger or fear. Throw them off with a dad joke-level response:
"Wow, your insults are like a broken pencil—pointless."
Report Like a Software Bug – If it’s serious, tell a teacher, parent, or principal. Bullies are like pop-up ads—best handled by blocking them.
Buddy System™ – Bullies are less likely to mess with groups. Stick with friends like glue sticks to glitter (aka forever).
Remember: They’re the Glitch, Not You – A bully’s words say more about them than you. You’re the main character—don’t let bad NPCs ruin your game.

And if all else fails? Picture them in ridiculous underwear. Works every time. 😆 You got this, champ!

#

I love the broken pencil analogy lol

jaunty helm Jun 24, 2025, 6:57 AM

#

cerulean kayak I mean I'll post the entire lesson worth of text, for the class is free; I don't...

yeah no, generative AI definitely isn't only language models
for example diffusion and rectified flow models are very dominant in image generation and I doubt anyone would call these "language models"
and GANs existed before any of the image gen and chatgpt hype, ig they're suddenly erased from history now

winter portal Jun 24, 2025, 1:20 PM

#

I'm currently developing a recruiter portal that allows job postings and enables candidates to apply. I aim to implement an intelligent ranking system that evaluates how well each candidate matches a given job description.

To achieve this, I plan to use a large language model (LLM) to analyze candidate profiles and generate a score, a justification for the score, a list of matching criteria, and any mismatches. This data would be stored in the database at the time of application submission.

One important consideration is that if the job description is updated, all existing applications for that job would need to be re-evaluated by the model.

I’m looking for guidance on whether this is a sound approach for using LLMs to rank candidates intelligently. Is there a more efficient or cost-effective method to achieve this? Are there any strategies or best practices for reducing costs in this setup?

serene scaffold Jun 24, 2025, 1:23 PM

#

winter portal I'm currently developing a recruiter portal that allows job postings and enables...

I would first consider the ethics of delegating such a consequential decision to an AI.

winter portal Jun 24, 2025, 1:24 PM

#

serene scaffold I would first consider the ethics of delegating such a consequential decision to...

thats a fair point! I also thought of it at first, but either way machines already rank candidates right? if not via LLMs via custom overlap metrics, embeddings, etc.
If we were to prompt the LLM to avoid bias perhaps it would be ethical?

serene scaffold Jun 24, 2025, 1:25 PM

#

that being said, you don't want to ask an LLM "give this text a rating out of ten and tell me why you gave it that rating", because LLMs are prone to generate a random rating, and then invent reasons that fit that rating. rather than the other way around. you would want to frame it as "describe how closely this text meets expectations, and then based on that, give it a score out of 10"

serene scaffold Jun 24, 2025, 1:26 PM

#

winter portal thats a fair point! I also thought of it at first, but either way machines alrea...

what kind of bias? you could just not make information like the name of the applicant available to the LLM.

winter portal Jun 24, 2025, 1:26 PM

#

makes sense! are there any methods to cut down on costs with such a setup? Cause I can imagine this can burn through tokens easily..

serene scaffold Jun 24, 2025, 1:26 PM

#

winter portal makes sense! are there any methods to cut down on costs with such a setup? Cause...

Not really.

winter portal Jun 24, 2025, 1:26 PM

#

serene scaffold what kind of bias? you could just not make information like the name of the appl...

hmm, redacting personal information seems like a good idea!

serene scaffold Jun 24, 2025, 1:27 PM

#

serene scaffold that being said, you don't want to ask an LLM "give this text a rating out of te...

this part is very important.

winter portal Jun 24, 2025, 1:27 PM

#

serene scaffold Not really.

Sikes. earlier I built a system where it generated these "ai insights" on the fly, upon every search.
ig storing it in the DB once after analysing scales much better.

winter portal Jun 24, 2025, 1:28 PM

#

serene scaffold this part is very important.

I'll take note of that and prompt in the exact order, thank you so much!!

#

I was thinking maybe we could batch the analysis part instead of making LLM calls for each applicant individually.. will that cut costs maybe?

#

considering LLM providers have batch APIs that are cheaper most of the times..

serene scaffold Jun 24, 2025, 1:30 PM

#

all the LLMs I use are on-prem

winter portal Jun 24, 2025, 1:31 PM

#

we haven't officially launched yet. If this were other startups where the initial traction wouldn't be known, I wouldn't bother too much about the scale factor.
But we are launching to 20k/30k potential users directly in a conference and don't wany the bill to skyrocket..

#

Using gemini 2.5-flash-lite right now, that might help a bit!

#

But the most important thing I really wanted to know was, if these systems/ similar systems for ranking in other domains are in practice.. this is kinda new to me and I don't want to do anything wrong/ that doesn't scale..

lethal tangle Jun 24, 2025, 1:37 PM

#

hi , i wanted help in learning AI ML , some one has recommended me to go through some University lectures , but as they rush through things and do not write any code in python it is getting boring , so is there a like project based way to learn these things as a beginner .

opaque falcon Jun 24, 2025, 3:19 PM

#

weak oxide I'm kinda limited to data science stuff and finance https://github.com/jialuech...

How did you learn about AI?

buoyant vine Jun 24, 2025, 3:46 PM

#

winter portal I'm currently developing a recruiter portal that allows job postings and enables...

maybe hot take but using generative AI to be the main classifier is a terrible way of achieving this. Using it to doing some bootstrapping? maybe. But the bulk of what you want from this thing is better done with supervised learning and much smaller transformer models that just look for one specific thing, and then you have several of them looking for particular characteristics and aggregating them

weak mortar Jun 24, 2025, 4:35 PM

#

the concept seemed so meaningless to me that i didnt bother digging at all, AI that generates... generate with AI.. yeah duh, thats what it does. but ofcourse i was just being ignorant

serene scaffold Jun 24, 2025, 4:43 PM

#

weak mortar the concept seemed so meaningless to me that i didnt bother digging at all, AI ...

most AI doesn't generate stuff

weak mortar Jun 24, 2025, 5:05 PM

#

you put data into it and it generates an output

serene scaffold Jun 24, 2025, 5:05 PM

#

weak mortar you put data into it and it generates an output

that's not the same thing

weak mortar Jun 24, 2025, 5:05 PM

#

semantics 101

opaque falcon Jun 24, 2025, 6:20 PM

#

Trying to find a project to apply my studying machine learning and AI. Could anyone suggest some interesting things to apply the tech?

#

I am into:

entrepreneurship
economics
community

velvet ice Jun 24, 2025, 7:38 PM

#

Can someone recommend me a beginner level project for ML (I js learned linear and polynomial regression)

untold dove Jun 24, 2025, 8:15 PM

#

hey i need some advice in regards to training a large scale architecture that is simlar to the idea of a gpt but not really the same

im worried that im going to spend a ton of money cloud wise and train this thing and its going to fail i want someone to look over my architecture give me some pointers and advice if anyone would be willing to do that that would be amazing

opaque falcon Jun 24, 2025, 8:55 PM

#

serene scaffold that's not the same thing

any suggestions?

agile cobalt Jun 24, 2025, 8:57 PM

#

untold dove hey i need some advice in regards to training a large scale architecture that is...

no one can predict how well an architecture will work before testing it, but you should conduct small scale tests before throwing a lot of money at it

#

I would strongly recommend against trying to train something truly large yourself though, unless you are sure enough it'll be worth the investment to the point you don't need to ask in public servers

serene scaffold Jun 24, 2025, 9:23 PM

#

opaque falcon any suggestions?

please don't ping people to ask them to answer a question that isn't specific to them

opaque falcon Jun 24, 2025, 9:24 PM

#

serene scaffold please don't ping people to ask them to answer a question that isn't specific to...

This is the question: Trying to find a project to apply my studying machine learning and AI. Could anyone suggest some interesting things to apply the tech?

serene scaffold Jun 24, 2025, 9:25 PM

#

opaque falcon This is the question: Trying to find a project to apply my studying machine lear...

why do you keep pinging me to answer this? it's not a question that's specific to me, so it's rude to keep making me get notifications on all my devices that you want me to answer it.

opaque falcon Jun 24, 2025, 9:41 PM

#

serene scaffold why do you keep pinging me to answer this? it's not a question that's specific t...

I assumed you were a expert and a friendly community member. My apologies.

serene scaffold Jun 24, 2025, 9:42 PM

#

opaque falcon I assumed you were a expert and a friendly community member. My apologies.

This is just pinging etiquette. It's not unfriendliness.

waxen kindle Jun 24, 2025, 10:07 PM

#

opaque falcon I assumed you were a expert and a friendly community member. My apologies.

Look on kaggle with your keywords, you may find some good projects

young beacon Jun 25, 2025, 12:53 AM

#

Does anyone know any AI newsletter which talks about real world use cases that people are building with LLMs and not just model releases and other updates

gentle stone Jun 25, 2025, 9:20 AM

#

Hello everyone, I'm interested in data science and ai, are there any careers in this field that don't require a college degree?

lucid wren Jun 25, 2025, 9:54 AM

#

gentle stone Hello everyone, I'm interested in data science and ai, are there any careers in ...

Yes their r many career options

weak oxide Jun 25, 2025, 10:32 AM

#

opaque falcon How did you learn about AI?

I'll be honest, some of the best foundations of finance and statistics allowed me to enter AI.

weak oxide Jun 25, 2025, 11:23 AM

#

opaque falcon This is the question: Trying to find a project to apply my studying machine lear...

Probably try something economics related because you have done economics before right

#

Though I'm not personally a fan of some of the idea, you can probably do a Federal Reserve sentiment analysis with Pytorch

gentle stone Jun 25, 2025, 12:02 PM

#

lucid wren Yes their r many career options

yes, I think I like to Pursue data or Ai career, are you also pursuing a career in data science or ai btw?

lucid wren Jun 25, 2025, 12:09 PM

#

gentle stone yes, I think I like to Pursue data or Ai career, are you also pursuing a career ...

Yes I Am a beginner too learning ML

gentle stone Jun 25, 2025, 2:03 PM

#

I also learn a little about ML from free courses

serene scaffold Jun 25, 2025, 2:04 PM

#

gentle stone Hello everyone, I'm interested in data science and ai, are there any careers in ...

There are virtually none. I always encourage people to learn about things that interest them, but if you can't get a degree, you should not pursue a career in data science or AI.

verbal oar Jun 25, 2025, 2:51 PM

#

you can just reply without pinging

#

clouds

serene scaffold Jun 25, 2025, 2:59 PM

#

verbal oar you can just reply without pinging

yes

opaque falcon Jun 25, 2025, 3:21 PM

#

gentle stone Hello everyone, I'm interested in data science and ai, are there any careers in ...

From what I have seen, if you can do the work and can prove it, then having a college degree doesn't matter. It seems you will have to learn the math from Khan Academy or Coursera.

#

I wouldn't be discouraged. It just take some work to do. Finding a project you can apply your skills with people you like helps.

opaque falcon Jun 25, 2025, 3:25 PM

#

verbal oar you can just reply without pinging

thanks

serene scaffold Jun 25, 2025, 3:31 PM

#

I want to reiterate that there are only so many jobs available in data science and AI, and if a company gets more applications from degree holders than they can interview, they're not going to interview any of the ones without degrees. I wish it weren't this way. At my company, your application simply will not be considered if you don't have a degree, even if no one else were to apply.

opaque falcon Jun 25, 2025, 3:34 PM

#

gentle stone I also learn a little about ML from free courses

Which courses? What do you like about them or don't like about them?

opaque falcon Jun 25, 2025, 3:36 PM

#

gentle stone Hello everyone, I'm interested in data science and ai, are there any careers in ...

A college degree requirement depends on the country you live and the type of industry. Some industries don't care. Google and Tesla don't care if you have a college degree. My tip is to contribute to open source libraries of popular tools and do things that stand out.

gleaming temple Jun 25, 2025, 3:45 PM

#

I'm currently working with a dataset for an internship, but I just don't know what direction to take at this time

#

It's climate/environment related

#

📎 mv_surface_temperatures_wide.parquet

#

The task is to improve the cleanliness of the data

#

right now there are a lot of inconsistencies/abnormalities within the set

#

I've messed with the metadata a little bit (associated seasons and country with each input), as I am on the sub-team working with metadata

#

But I think I want to get involved with more analytical stuff, and that's why I'm here lol

jaunty helm Jun 25, 2025, 4:05 PM

#

gleaming temple But I think I want to get involved with more analytical stuff, and that's why I'...

what are you asking then exactly? ways to detect anomalies in data?

gleaming temple Jun 25, 2025, 4:05 PM

#

jaunty helm what are you asking then exactly? ways to detect anomalies in data?

Yeah I think so

#

I apologize for the inital vagueness of the question

jaunty helm Jun 25, 2025, 4:11 PM

#

gleaming temple Yeah I think so

well then the word you're looking for is anomaly detection
I assume it's a geospatial time series judging from the name, so you can look that up
last time I did stuff on time series I tried using the matrix profile which worked surprisingly well despite how simple it seems (though time series are wild so may not work for you)

gleaming temple Jun 25, 2025, 4:12 PM

#

jaunty helm well then the word you're looking for is anomaly detection I assume it's a geosp...

Thanks for the information, I'll do some research on that

#

I'm mostly new to this sort of stuff as I'm still in high school, but I'm also quite eager to learn

halcyon yew Jun 25, 2025, 5:43 PM

#

I tried tensorflow, and the model i trained, was Confidently Wrong 1000% of the time

warped lintel Jun 25, 2025, 7:57 PM

#

i am trying to use the inference package from roboflow and when i attempt to run my model i get the following error.

ImportError: cannot import name 'YOLACT' from 'inference.models' (C:\Users\sims4\AppData\Local\Programs\Python\Python310\lib\site-packages\inference\models\__init__.py) im not sure what to do i have already installed inference-gpu[tensors], inference-gpu[yolo-world] and inference[gaze]. I also have cuda and cudnn configured correctly

opaque condor Jun 25, 2025, 8:45 PM

#

where would getting many diffrent images for a convolution network

agile cobalt Jun 25, 2025, 8:49 PM

#

look for open datasets or scrape TPF_02_Shrug
ideally the former

opaque condor Jun 25, 2025, 8:54 PM

#

I cant see anything on open datasets

opaque condor Jun 25, 2025, 9:05 PM

#

agile cobalt look for open datasets or scrape <:TPF_02_Shrug:968285941892534372> ideally the ...

im looking for finch data set

agile cobalt Jun 25, 2025, 9:12 PM

#

opaque condor im looking for finch data set

that is pretty specific

you might want to try creating small dataset manually, then training a classifier to filter though a generic bird images dataset

opaque condor Jun 25, 2025, 9:15 PM

#

I am how many photos should I get for each label and sub-label

#

Main-Label:the photo\bird

Sub-label:if the bird is male or female

agile cobalt Jun 25, 2025, 9:17 PM

#

I don't think you can reliably predict the gender from a image? at least not for all species

opaque condor Jun 25, 2025, 9:19 PM

#

I know but for cardnels, bluejays, humming birds,
Chicken

opaque condor Jun 25, 2025, 10:00 PM

#

agile cobalt I don't think you can reliably predict the gender from a image? at least not for...

but who knows

opaque condor Jun 25, 2025, 10:51 PM

#

agile cobalt I don't think you can reliably predict the gender from a image? at least not for...

A computer was able to without being told to found the biological sex of a photo of their eye

serene scaffold Jun 25, 2025, 10:55 PM

#

opaque condor A computer was able to without being told to found the biological sex of a photo...

Can you rephrase that? I don't quite understand what you mean