#data-science-and-ml

1 messages · Page 141 of 1

violet gull
#

youtube or looking up sources

warm iron
#

Hi! is it okay to talk about Machine Learning here?

scarlet owl
#

read channel description

warm iron
#

has anyone ever had experience working with raw ECG data?

proper crag
#

@lyric furnace mate just keep learning python understand when you can utilize loop, if else,aggregation operations, etc...even python hv libraries just trust me ...ive seen ml code that is using for loop and the code is something like 200+ lines regardles after utilizes libraries

lapis sequoia
#

minimal plot of different activations vs accuracy

(mnist, 2 layer perceptron.)

lapis sequoia
#

so i red up and duckdb is optimized for analytical queries on giant databases whereas sqlite is optimized for single writes

#

which is why duckdb is so slow for logging

#

but nested dicts are still 10 times faster, but I haven't figured out and thus tested how to do threaded or async database writes

#

when they remove gil it will be much easier

indigo wing
#

Hey, anyone knows about sematic search engines?

toxic mortar
#

What are your go-to methods to evaluate your clssification model performance on huge unseen dataset?

lapis sequoia
lapis sequoia
#

interesting, i haven't seen it used, and read a few posts yesterday that didn't seem too positive

#

like basically saying it reduces to tanh the useful part

#

the periodicity didn't really help if i understood correctly

#

i had the same results as relu with sine in segmentation but maybe its making some tasks harder

#

also interesting function modulus (abs(output)) haven't tried it

#

interesting

#

also I tried an activation function that takes maximum among first half of the channels and second half of the channels, and minimum, and concatenates them, and it worked, although wasnt as good as relu

#

its crazy how you can give it any weird model and it will find how to use that model

#

by it i mean gradient descent

#

that repo is piece wise linear units right?

#

yeah

#

with any number of segments, that's somewhat new to me

lapis sequoia
#

(...) While sinusoidal activation functions have been successfully used for specific applications, they remain largely ignored (...)
[we] describe how the presence of infinitely many and shallow local minima emerges from the architecture.
(...) by showing that for several network architectures the presence of the periodic cycles is largely ignored (...)
etc.

may not be the best paper though (and may be incorrect.), just one i found.

runic parcel
#

i hears Cyc is a knowledge database, but can i use it to train my model? how can i get the code?

lapis sequoia
#

can anyone help me with this error

serene scaffold
lapis sequoia
#

i think your generator may have ran out of data

serene scaffold
lapis sequoia
lapis sequoia
serene scaffold
#

!paste

arctic wedgeBOT
#
Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

lapis sequoia
#

you need to make sure that the steps _ execution *epochs is less than the n of batches, it's a common error, just read the docs for PyDataset

lapis sequoia
#

the generator values are called once (till it runs out of data.), so that's why. in the case of tensorflow you can use .repeat idk if there is anything like that for pydataset.

#

import tensorflow as tf
from tensorflow.keras.preprocessing.image import ImageDataGenerator

tf.random.set_seed(42)

#preprocess the data (pixels in the range of 1 to 255)
train_datagen = ImageDataGenerator(rescale = 1./255)
valid_datagen = ImageDataGenerator(rescale = 1./255)

train_dir = '/content/pizza_steak/train'
test_dir = '/content/pizza_steak/test'

import data from directories and turn them into batches

train_data = train_datagen.flow_from_directory(directory = train_dir, batch_size=32,
target_size=(224, 224), class_mode="binary", seed=42)

test_data = valid_datagen.flow_from_directory(directory = test_dir, batch_size=32,
target_size=(224, 224), class_mode="binary", seed=42)

Build a CNN model

model_1 = tf.keras.models.Sequential([
tf.keras.layers.Conv2D(filters=10, kernel_size=3, activation='relu', input_shape=(224,224,3)),
tf.keras.layers.Conv2D(10, 3, activation = 'relu'),
tf.keras.layers.MaxPool2D(pool_size=2, padding='valid'),
tf.keras.layers.Conv2D(10, 3, activation = 'relu'),
tf.keras.layers.Conv2D(10, 3, activation = 'relu'),
tf.keras.layers.MaxPool2D(2),
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(1, activation='sigmoid')
])

#compile our model
model_1.compile(loss='binary_crossentropy',
optimizer = tf.keras.optimizers.Adam(),
metrics=['accuracy'])

history_1 = model_1.fit(train_data,
epochs=5,
steps_per_epoch = len(train_data),
validation_data = test_data,
validation_steps = len(test_data))

serene scaffold
lapis sequoia
#

i think this could work (not fully sure.):

steps_per_epoch = (len(train_data)//(batch_size))//epochs),

lapis sequoia
#

did you add //epochs

lapis sequoia
#

i think that steps per execution * batch size have to be less than the training data

#

the same applies to validation steps

#

actually you could try both with the number 30 and check @lapis sequoia ?

#

otherwise you may have to ask either in a separate question in #1035199133436354600 or maybe in a forum (or wait for others to help)

lapis sequoia
#

no sorry, by passing into steps per execution a number smaller than the number of batches generated (not the batch size.) @lapis sequoia
same applies to validation_steps, using a number smaller than the number of test batches generated (should actually be less than len(data)//batch_size) for each case, if i understand correctly.

lapis sequoia
#

no problem, you've got several other errors

#

@lapis sequoia can you please explain me what was the issue with the previous code?

#

i think it expects an Input layer

#

yeah let me see if i can find a thread explaining it, i can't now

lapis sequoia
#

if you ever use tf.data.Dataset instead, it's got this option (extracted from link.):

`If you're using a tf.data.Dataset, you can also add the repeat() method, but be careful: it will loop indefinitely (unless you specify a number)

#

can happen is using XLA (incomplete batches break it), then use drop_remainder=True, not your case though.

oblique isle
#

Guys what s better to analyze Academic papers ? Claude or GPT 4 ?

lapis sequoia
#
lapis sequoia
lapis sequoia
#

summary on sine vs tanh (paper is here https://openreview.net/pdf?id=Sks3zF9eg), pretty interesting read

tldr; sine seems better for intuitively periodic tasks (like addition.), and comparable to tanh in std cases
(not surprising that is kinda works in many tasks, but not due to periodicity.)

lapis sequoia
#

is random grid search just straight up better than quasi random search?

#

because it has the lowest discrepancy possible

#

and quasi random search is better than random search

#

so random grid search is the best?

unkempt wigeon
#
#===[imports]===#
import numpy as np
#===============#

X = np.array([0.1, 0.2, 0.3, 0.4])


converted_data0=np.asarray(X)

print(converted_data0)
unkempt wigeon
#

how can i get the array to collect to the data andrun it through the network?

serene scaffold
#

and what network?

lapis sequoia
unkempt wigeon
#

the nerons

serene scaffold
unkempt wigeon
#

im working on the array to increse the speed this was to test the use before joing the mail code

serene scaffold
unkempt wigeon
#

no

serene scaffold
#

There is no reason to have converted_data0=np.asarray(X) in your code. X is already an array.

#

Try writing more code that represents layers of a network, and write code to send an array through the network.

unkempt wigeon
#
#===[imports]===#
import sys
import numpy as np
import matplotlib
#===============#

#===[neuron network]===#
np.random.seed(0)

X = [[1, 2 ,3,2.5],
    [2.0,5.0,-1.0, 2.0],
    [-1.5, 2.7, 3.3, -0.8]]

class Layer_Dense:
    def __init__(self, n_inputs, n_neurons):
        self.weights =0.10 * np.random.randn(n_inputs, n_neurons)
        self.biases = np.zeros((1, n_neurons))

    def forward(self, inputs):
        self.output = np.dot(inputs, self.weights) + self.biases

layer0 = Layer_Dense(4,5)              
layer1 = Layer_Dense(5,9)
layer2 = Layer_Dense(9,4)
layer3 = Layer_Dense(4,2)


layer0.forward(X)
layer1.forward(layer0.output)
layer2.forward(layer1.output)
layer3.forward(layer2.output)

print(layer3.output)
serene scaffold
unkempt wigeon
#

yes

#

[[ 5.86410565e-03 4.20239779e-05]
[ 4.60184756e-03 2.41869992e-03]
[ 1.37659937e-02 -1.03951813e-02]]

#

my apoliges

serene scaffold
unkempt wigeon
#

yes its the outputs combined from the neurons getting all posible outputs from the set inputs. my apoliges

serene scaffold
unkempt wigeon
#

segilopa ym wonk t'nod i

serene scaffold
#

what?

main fox
#

I thought it was another language but it's just reversed lol

serene scaffold
unkempt wigeon
#

no my apoliges

serene scaffold
unkempt wigeon
#

my apoliges

serene scaffold
unkempt wigeon
#

make it be able tolearn colors from images and other things too

serene scaffold
serene scaffold
#

why is that color the learned output for that image?

unkempt wigeon
#

fox faces and the color green to start green because you recommended it and foxes as there faces are unece in shape and perportions

unkempt apex
#

what the hell is going on !!😂

serene scaffold
unkempt wigeon
#

maybe i should teach it colors first my apoliges

unkempt apex
#

bruhhh....

serene scaffold
unkempt apex
#

bro is high on something I guess ..

serene scaffold
unkempt apex
#

and in that post, he also apologies..

unkempt wigeon
#

list of colors

#

primarly green

#

secondary blue

#

third white

#

fourth is brown

#

my apoliges

unkempt apex
#

bruhhh..

unkempt wigeon
#

my apoliges

unkempt apex
#

so by this way, no one will help you

#

and only apologies you!

unkempt wigeon
#

im sorry

#

i can figure out the rest i just need help help with one color my apoliges

#

@unkempt apex

unkempt apex
#

just post the questions properly, so other guys will look into it

#

without apologies

unkempt wigeon
#

I just need help with one color input and then I can use a different output later

serene scaffold
unkempt wigeon
#

im sorry

serene scaffold
unkempt wigeon
#

for bothering you

rough grove
#

should i learn pytorch or tensorflow first

lapis sequoia
twin acorn
#

ive been ignored for houirs any help is appreciated

serene scaffold
serene scaffold
# rough grove should i learn pytorch or tensorflow first

pytorch and tensorflow are two libraries that do the same thing. it's not a foregone conclusion that you need to know both. I recommend focusing on one.

but I also recommend learning a lot of other things before you get anywhere near neural networks.

rough grove
serene scaffold
rough grove
serene scaffold
main fox
# serene scaffold I work in language ai

Research or industry? Also, how much regex do you use? I recently found myself with a project that initially sounded like it would need NER but regex worked very well.

unkempt apex
rich river
#

any data mining projects recommended?

faint quail
#

whats the point of data mining/hoarding

indigo wing
# faint quail whats the point of data mining/hoarding

to gather data, to make sense of unstructured data mostly. Like whether you require that column on your dataset, example: you want to find avg height of 11-21 yrs old, you take a lot of data that contains their names, age, sex, bmi, address etc. Now which all you want, what's the dtype of the data, do you need to create more columns? This is looking mostly data mining

covert cave
#

hii, how can I solve this error :FileNotFoundError: [Errno 2] No such file or directory: 'C:\programs\anaconda3\Lib\site-packages\matplotlib\backends\web_backend\js\mpl.js'
for this code :%matplotlib notebook
plt.plot(y_test,label='Real values')
plt.plot(california_y_predicted,label='guess values')
plt.legend();

lapis sequoia
#

this paper has a lot of cool stuff with activations, if anyone wants to waste their time, certainly llm s may summarise it though.
https://arxiv.org/pdf/1710.05941

paper garnet
#

is there anyone knows machine learning libraries like Tensorflow, pyTorch, Scikit-learn #data-science-and-ml

lapis sequoia
#

are people here more in the camp of illusionists, materialists, reductionists, panpsychists, dualists,... ?

spare forum
lapis sequoia
lapis sequoia
#

but I havent read him

#

i only red parfit

#

I just think if you are a materialist and not illusionist than you cant not be a dualist

lapis sequoia
lapis sequoia
#

havent red anything else on phil of mind

#

i thought the personal identity chapter from reasons and persons was good because i agreed with it

lapis sequoia
lapis sequoia
#

actually he invented the teleportation paradox

#

and his other view is utilitarianism and he also has a whole bunch of very interesting and weird paradoxes even though i don't care too much about phil of ethics

#

that's cool, what are your main areas of interest @lapis sequoia ?

lapis sequoia
lapis sequoia
lapis sequoia
lapis sequoia
#

but can still be good

lapis sequoia
sullen marsh
#

Is there anyone can provide a learning path of AI/ML engineer from zero to hero?

fiery bane
rich river
fiery bane
#

like, find some topic that you like, sports, movies, etc, and then do data mining on that topic

serene scaffold
serene scaffold
viscid socket
#

Does anyone know somewhere that I can download/mine large amount of resumes? I am thinking of making an anonymous resume dataset for SWOT analysis

lapis sequoia
hushed canopy
#

Hi guys. I need books (or other resources) to learn Data Structures and Algorithms.
Please recommend.

lapis sequoia
#

dropout 4all

serene grail
# lapis sequoia dropout 4all

So is dropout commonly used? From "it's used in basically every NN" to "it's almost never used", how common is it?

serene scaffold
lapis sequoia
#

it may not work well with ReLUs in very deep networks, but im unsure whether this is fully established.

serene grail
#

I see, thank you both

runic parcel
lapis sequoia
green herald
#

Hello, I'm in search for a DE mentor. Is a good place to ask?

fallow frost
#

how do I even anwser to:
"What is your experience managing batch and incremental data ingestion processes?"

#

is incremental live data?

green herald
#

I don't know much. I know Python well but not much pipes and data flows.

green herald
#

As I see, I need to know SQL well and be able to work with ETL in the cloud.

spare forum
unkempt apex
#

!paste

arctic wedgeBOT
#
Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

unkempt apex
#

is this good U-Net ?

#

because confuse about output layer

unkempt apex
#

yeah updating..

#

?

#

and also this one
blue arrows represent conv

#

yeah like that!

#

that paper is really nice.

unkempt apex
#

dataset is taking so much time to upload

#

on kaggle

#

what if I restart my session, will that 4GB dataset will it gone?

#

shit then what's the best way?

#

I have already

#

download in ka ggle?

#

how>?

smoky basalt
#

where should i start i need help, i already know intermediate python and know custom tkinter i need some help in starting projects in data science and ai

green herald
#

I don't think you can have a national flag icon.

spring field
lapis sequoia
#

I think I might have came up with a machine learning concept, I don't think I heard this concept anywhere else.

Concept: The machine gets a bunch of information and if the information if relevant to the task, it will store and keep the info. Else if it isn't relevant to the task, it will save it just in case it's useful for another task. Else, it will delete the info as it's not needed.

green herald
spring field
#

I'm also not sure how that's related to machine learning, the machine learning would be determining whether information is relevant or not

lapis sequoia
# spring field between relevant and irrelevant what is the "else"

For example, lets say your friend tells you he picked up a pen. It's information but it's extremely useless.
There are 3 sections.
Useful - Useful to the task
Junk - Stores the info, in case if it's relevant to another task
Trash - Completely useless information

I think this concept can go well with reinforcement learning. Dealing with the information efficiently..?

Sorry, I don't know much about machine learning, but the concept of it intrigues me.

#

sorry, i wasnt done explaining

spring field
#

There are 3 sections.
Useful
Junk

I'm not quite following
we're back to two states, initially you implied at least 3 different states (relevant, irrelevant, else (which isn't really possible, since the other two states should cover everything already))

smoky basalt
#

im learning tensors

#

i learnt how to make tensors 😂

spring field
lapis sequoia
spring field
#

it either is useful or it is not

lapis sequoia
#

True, true

#

Then I would need a system that can detect fake information

#

Because in that scenario, fake information can be relevant.

left tartan
rich river
lapis sequoia
# left tartan I think you're asking: how can I detect which variables are significant and whic...

Yes.
Although, I was thinking of storing all it's knowledge in a list.

A method I thought of to get rid of the fake information is to put the program through a test and see if it can successfully complete it with no errors. Once it passed, it will keep the information.
Though, I would constantly need to create a test.

That's why I need to come up with an efficient method that can make sure the machine doesn't store false information.

I based this upon how we learn. Let's say we read a book, we absorb all it's information and we store it as useful and good. And when we encounter fake info, we can just dismiss it with our knowledge from the book and disprove it.

But I'm still struggling on how this method would be useful in.

Sorting data?

left tartan
#

Fake is probably not the word you mean then; fake means (to me) inaccurate or misleading data, vs 'irrelevant' (noise).

lapis sequoia
#

Yeah. Sorry about that. Got a little off track.

#

This honestly sounds like it can be used in sorting data

#

I wish I knew more about machine learning. Anyone got any resources I can use? Thanks.

robust jungle
#

does anyone have any tips on making template matching characters more reliable? I want to be able to identify characters in a game menu, which has a consistent font. Currently I'm using cv2.matchTemplate alongside a collection of rendered characters in that font. To my eye the characters look to be about the same size as the ones in the image, and I'm using Image.convert to make sure the colors match. Any ideas?

fiery bane
iron sparrow
#

Thanks for passing this along

runic parcel
#

How is Geospy Ai model trained? What data did they use and how can it be done?

rigid timber
#

can anyone help me find a pre trained model for a medical chatbot

unkempt apex
#

there are some who predict diseases based on symptons

rigid timber
wise bane
#

without getting too deep into what i want to do, its basically a "machine learning algorithm" that can differentiate between slides of blood with cancer and without cancer by using a control data set and a data set that has cancer, how can i achieve this?

unkempt apex
#

got this while loading dataset using DataLoader

spare forum
#

Show the code too uh

#

torch Dataloader?

#

Seems like you passed empty data or idk

unkempt apex
rigid timber
runic parcel
raw pasture
#

Hey guys who is good in machine learning t help me with a project.Anyone

serene scaffold
raw pasture
#

okay

lapis sequoia
lapis sequoia
unkempt apex
lapis sequoia
#

alr, sounds good

unkempt apex
#

again this stupid error

although I am directly making notebook using the dataset

unkempt apex
spare forum
#

?

#

asking a question because you droped the error with 0 code

unkempt apex
#

@spare forum the code is in pic

#

do you need that class code of how I am loading data?

spare forum
#

it's with datasets.ImageFolder from torchvision no ? or handmade

unkempt apex
#

the dataset is inherited with Dataset from torch.utils.data

spare forum
#

check result maybe idk

#

I don't really know the problem

#

I would do like from torchvision import datasets then the code is very similar like datasets.ImageFolder(root="..." , transform = train_transform)

#

but something went off with this I guess

unkempt apex
#

the train images was .jpg and I was checking for .png😂

spare forum
#

☠️

unkempt apex
#

so now, the project is road extraction from satellite images
where
/train -> satellite images and masks(label)
/test -> sat images
/valid -> sat images

#

so how can I train my model?>
because we can't calculate validation loss as there are no masks for to vaildate and insimple even compare

faint quail
#

how to do backpropagation with tensorflow

this is what I have so far

class Conv2d(Layer):
    def __init__(self, depth, kernel_shape=[3, 3], stride=1, variance="He"):
        self.kernel_shape = np.array(kernel_shape)
        self.variance = variance
        self.depth = depth
        self.stride = stride

    def forward(self, input_activations, training=True):
        output_activations = self.biases.copy()
        # for i, kernels in enumerate(self.kernels):
        #     for kernel, channel in zip(kernels, input_activations):
        #         output_activations[i] += scipy.signal.correlate2d(channel, kernel, "valid")

        start_time = time.time()

        output_activations += tf.nn.conv2d(
            input_activations.reshape(*input_activations.shape, -1).T, 
            np.flip(self.kernels.T, (0, 1)), 
            strides=[1, 1, 1, 1], 
            padding='VALID'
        )[0].numpy().T

        end_time = time.time()

        if training:
            self.output_activations = output_activations

        return output_activations

    def backward(self, input_activations, node_values):
        new_node_values = np.zeros(input_activations.shape)
        kernels_gradient = np.zeros(self.kernels.shape)

        for i, (kernels, kernel_node_values) in enumerate(zip(self.kernels, node_values)):
            for j, (image, kernel) in enumerate(zip(input_activations, kernels)):
                
                kernels_gradient[i, j] = scipy.signal.correlate2d(image, kernel_node_values, "valid")
                new_node_values[j] += scipy.signal.convolve2d(kernel_node_values, kernel, "full")

        kernels_biases_gradient = node_values
        return new_node_values, [kernels_gradient, kernels_biases_gradient]

The forward pass is really fast, but the full convolutional operation is really slow and I can't figure out how to write it using tensorflow which is much faster

lapis sequoia
agile anvil
faint quail
# faint quail how to do backpropagation with tensorflow this is what I have so far ```py cla...

nvm I figured it out

    def forward(self, input_activations, training=True):
        output_activations = self.biases.copy()
        output_activations += tf.nn.conv2d(
            input_activations.reshape(*input_activations.shape, -1).T, 
            np.flip(self.kernels.T, (0, 1)), 
            strides=[1, 1, 1, 1], 
            padding='VALID'
        )[0].numpy().T

        if training:
            self.output_activations = output_activations

        return output_activations

    def backward(self, input_activations, node_values):
        new_node_values = np.zeros(input_activations.shape)
        kernels_gradient = np.zeros(self.kernels.shape)

        new_node_values = tf.nn.conv2d_backprop_input(
            [1, *input_activations.shape[::-1]],
            filters = self.kernels.T,
            out_backprop = node_values.reshape(*node_values.shape, -1).T,
            strides = [1, 1, 1, 1],
            padding = "VALID",
        ).numpy()[0].T

        kernels_gradient = tf.nn.conv2d_backprop_filter(
            input_activations.reshape(*input_activations.shape, -1).T,
            self.kernels.shape[::-1],
            out_backprop = node_values.reshape(*node_values.shape, -1).T,
            strides = [1, 1, 1, 1],
            padding = "VALID",
        ).numpy().T

        kernels_biases_gradient = node_values
        return new_node_values, [kernels_gradient, kernels_biases_gradient]
merry mica
#

@lapis swift

lapis sequoia
#

quite a beautiful theorem, the universal approximation theorem in 2 short paragraphs

#

Any continuous fn in a subset of extended euclidean space can be approximated by a 1 -hidden- layer neural network; with infinite neurons.
Interestingly, many theorems cite that the activation fn must be non-polynomial, which many papers seem to ignore (they test polynomials fn and fail.).

mild bear
#

Hello, I am currently working on a research project and I've run into a problem with my minimax algorithm. Some backstory, the project is aims to integrate minimax strategies into the selection phase of the MCTS algorithm implemented in Michael Hu's AlphaZero "clone/model". The MCTS used in AlphaZero is a bit different from a traditional MCTS algorithm is these ways: 1.) After the search reaches a leaf node, there is no rollout. Instead, AlphaZero uses the neural network to evaluate the board position and uses that as an estimated game result to update the statistics in the search tree.
2.) When expanding a leaf node, all children are expanded in a single operation, rather than the standard MCTS, which expands one child at a time. This means that after node expansion, a leaf node immediately becomes fully expanded.
3.) AlphaZero uses a slightly different UCT algorithm to select the best child during the selection phase, which incorporates the prior action probabilities from the output of the neural network. There's a lot of code but main issue I'm having is, my minimax function for some reason doesn't work, it does not select the correct max or min values based on the evaluation value gotten at the terminal state

#

The minimax function with alpha-beta pruning:

#
def minimax(
        env,
        node: Node,
        depth: int,
        alpha: float,
        beta: float,
        maximizing_player: bool,
        eval_func: Callable[[np.ndarray], Tuple[Iterable[np.ndarray], float]]
) -> float:

    # Base case: if we reach the maximum depth or the node is terminal (not expanded)
    if depth == 0 or not node.is_expanded:
        # Use the environment's observation and eval_func for terminal state evaluation
        observation = env.observation()  # Get the observation from the environment
        _, value = eval_func(observation)

        return value

    # Get the legal moves (i.e., child nodes that are expanded)
    legal_moves = np.where(node.child_N > 0)[0]

    if maximizing_player:
        max_eval = float('-inf')
        for move in legal_moves:
            child_node = node.children[move]
            eval = minimax(env, child_node, depth - 1, alpha, beta, False, eval_func)
            max_eval = max(max_eval, eval)
            alpha = max(alpha, eval)
            if beta <= alpha:
                break  # Beta cut-off
        return max_eval
    else:
        min_eval = float('inf')
        for move in legal_moves:
            child_node = node.children[move]
            eval = minimax(env, child_node, depth - 1, alpha, beta, True, eval_func)
            min_eval = min(min_eval, eval)
            beta = min(beta, eval)
            if beta <= alpha:
                break  # Alpha cut-off
        return min_eval
lapis sequoia
#

i wish 5+ lines code blocks were collapsed by default

mild bear
#

could i do that?

lapis sequoia
#

i don't think so, not your fault at all; you can paste a link but then less people would see it, so it's fine :-)

mild bear
#

The selction function:

#
def best_child(
        env,
        node: Node,
        legal_actions: np.ndarray,
        c_puct_base: float,
        c_puct_init: float,
        child_to_play: int,
        eval_func: Callable[[np.ndarray], Tuple[Iterable[np.ndarray], Iterable[float]]],
        alpha: float = 0.5,
        minimax_depth: int = 2,
) -> Node:
    
    if not node.is_expanded:
        raise ValueError('Expand leaf node first.')

    ucb_scores = -node.child_Q() + node.child_U(c_puct_base, c_puct_init)

    # Initialize the minimax scores for legal actions
    minimax_values = np.full_like(ucb_scores, fill_value=-9999.0, dtype=np.float32)

    # Apply minimax to the legal actions only
    for move in range(len(legal_actions)):
        if legal_actions[move] == 1:
            if move in node.children:
                minimax_values[move] = minimax(env, node.children[move], minimax_depth, float('-inf'), float('inf'),
                                               node.to_play == 1, eval_func)
            else:
                # If the child node does not exist, treat it as a leaf with no Minimax value
                minimax_values[move] = 0

    # Combine the UCB scores and Minimax values with a weighted sum
    combined_scores = (1 - alpha) * ucb_scores + alpha * minimax_values

    # Exclude illegal actions by setting the combined scores to -9999
    combined_scores = np.where(legal_actions == 1, combined_scores, -9999)

    # Select the move with the highest combined score
    move = np.argmax(combined_scores)

    assert legal_actions[move] == 1

    if move not in node.children:
        node.children[move] = Node(to_play=child_to_play, num_actions=node.num_actions, move=move, parent=node)

    return node.children[move]
#

Thanks in advance for any help🙏

verbal oar
#

hmm if classical usage of pca was face recognition
and if pca is dimensionality reduction and equivalent is autoencoder so autoencoders are used for face recognition?

#

I meant extracting features with pca

#

hmm but with neural networks here is no need for feature extraction

quaint rivet
#

stuck at this error

#
ModuleNotFoundError                       Traceback (most recent call last)

<ipython-input-3-87b76c9d6778> in <cell line: 12>()
     10 
     11 from mrcnn.config import Config
---> 12 from mrcnn.model import modellib, utils

/content/mrcnn/model.py in <module>
     21 import keras.backend as K
     22 import keras.layers as KL
---> 23 import keras.engine as KE
     24 import keras.models as KM
     25 

ModuleNotFoundError: No module named 'keras.engine'
#

i'm trying to use mask rcnn code. But i don't know why it's giving me this error

deep iris
#

What is Batching of LLM Jobs, How Can It Reduce LLM Inference Cost, and How Can It Help Overcome Challenges Like Rate Limiting and GPU Utilization?

In this article, I explained all the above concepts. Please have a read and let me know your thought.
https://blog.cuminai.com/unlocking-the-power-of-job-batching-transforming-ai-workloads-2220b8c05e4f

Medium

Understanding what is LLM batching API, How it can be helpful? what are the different use cases of it? What can be possible cost saving?

lapis sequoia
#

maybe that means that all components of C, W, b can be found?

#

no, the fn just needs to be continuous

#

that was my 1st interp. but i think it just means that sigma would get undefined constants multiplying it

#

if it means that though, it makes sense that it's calling it sigma, and those were proven much later

#

ReLU, GeLU and so on, in fact some were proven only recently!

#

and also for discontinous functions (2023)

#

it's the form most often quoted; the 1st theorem was proven only for sigmoid iirc

#

it was extended to relu etc

#

Also, certain non-continuous activation functions can be used to approximate a sigmoid function, which then allows the above theorem to apply to those functions. For example, the step function works. In particular, this shows that a perceptron network with a single infinitely wide hidden layer can approximate arbitrary functions.

#

true, i thought it was for relus; those were proven later, i may get the paragraph

#

are you on the wikipage?

#

im not sure why, but the first was proven for sigmoids, this is the line:

The first examples were the arbitrary width case. George Cybenko in 1989 proved it for sigmoid activation functions
the paper is paid though

#

maybe but they can be used though, it's just proven later on from what i read there, then that's why we use XLUs

#

yes, plus people use x^2 as well from what i understand

#

you may need to check that paper, kinda funny to put it under a paywall since it's got 30K citations, well maybe thats why

#

for me visually, it makes sense that ReLUs can approximate any fn, more than sigmoids

cinder tangle
#

Hi,
I am working on a small project related to RAG and am stuck (Apparently cuz I don't know much)

I used mxbai-embed-large as embeddings and Chroma db as Vector store all goes well to this point.

Issue: When I try to retrieve data with similarity threshold it returns 0 docs and without threshold and k it always returns 4 docs no matter the query.
What is it that I am doing wrong?
Here my Code:

Vector Store Creation File:

# Load Docs and then store embeddings in the Chroma DB
from langchain_community.embeddings import OllamaEmbeddings
from langchain_community.document_loaders import PyMuPDFLoader
from langchain_text_splitters import RecursiveCharacterTextSplitter
from langchain_chroma import Chroma

embeddings = OllamaEmbeddings(
    base_url="http://43.204.231.131:11434",
    model="mxbai-embed-large",
)

loader = PyMuPDFLoader("./data/aliceShort.pdf")
data = loader.load()
# print(len(data))

text_splitter = RecursiveCharacterTextSplitter(
    # Set a really small chunk size, just to show.
    chunk_size=300,
    chunk_overlap=100,
    length_function=len,
    add_start_index=True,
)

chunks = text_splitter.split_documents(data)
print(f"Split {len(data)} documents into {len(chunks)} chunks.")


db = Chroma.from_documents(chunks, embeddings,persist_directory="./chroma_langchain_db")

query = "Who is Alice?"
docs = db.similarity_search(query)
print(docs[0].page_content)

Query File:

from langchain_community.embeddings import OllamaEmbeddings
from langchain_chroma import Chroma

embeddings = OllamaEmbeddings(
    base_url="http://65.2.37.27:11434",
    model="mxbai-embed-large",
)
db = Chroma(persist_directory="./chroma_langchain_db", embedding_function=embeddings)
query_text="Who is Alice?"
retriever = db.as_retriever(
    search_type="similarity_score_threshold", search_kwargs={"score_threshold": 0.1})
docs = retriever.invoke(query_text)
print(len(docs))
lapis sequoia
#

this is also quite interesting lol:

Notice also that the neural network is only required to approximate within a compact set K. The proof does not describe how the function would be extrapolated outside of the region.

#

Well, i guess it does badly outside the region

#

i read it as: the approximation will be accurate within the training set (or the area sufficiently covered by it)

#

yeah, but it's limited unless you have all the data

#

so you may learn any fn fitting the data in a subarea

#

i don't mean infinite data, i mean all the data

#

uhmm..to me it's got a more practical reading as well. since we normally have a subset of the data, that may be fully linear (described by a line/plane/etc), and you may find f for that region K, in any-dimensional space

#

but that's f for K which is the universe for the model, since it hasn't seen outside K

#

no, the dataset (datapoints) would get g to approach f

#

so one works backwards and the dataset points define K for g which will approach to f for K

#

then any datapoint outside K will never be predicted correctly unless one gets lucky

#

that leaky relu and such can approximate any fn seems intuitive,
imagine that you put all weights to 0 apart from some (what ReLU does.),
and they add up to a local line that is 'tangent' to the real fn; for the offset, one's got the bias.

#

(the infinite neurons of the single layer are the infinite segments)

#

apparently this is why polynomials fail, i've no idea what it means

#

Paper is MULTILAYER FEEDFORWARD NETWORKS WITH A NON-POLYNOMIAL ACTIVATION FUNCTION CAN APPROXIMATE ANY FUNCTION sorry it's in caps

#

this is another "proof" of universality for maxout networks, it's visual. i can't get it very well, but seems intuitively right?

lapis sequoia
#

1992

spare forum
#

C(Rn) would be continous functions Rn->R

#

idk about Sigma_n

lapis sequoia
#

yup same, that's why i shared the paper's title below

#

but Density is mentioned in the wikipage as well

#

idk what that is either

lapis sequoia
spare forum
lapis sequoia
toxic mortar
#

Hey guys,

I’ve created a multiclass classification model and trained it on a labeled dataset. Went pretty well on the local dataset tbh and I’m now looking to soft-launch it into prod. The input data will be converted into an n-dimensional input vector, which won’t form a convex or regular shape when plotted on a chart (at least my EDA shows that). Since I can’t foresee every possible model input, the model won’t handle every scenario perfectly, which is i guess okay, but I am looking for broad use-case. Which will lead to a number of false positives, which I want to iteratively add to my training data corpus and improve the model overtime.

I’m looking for an efficient approach to identify and manage these false positives. I was thinking about:
1)Randomly sampling a subset of the data and label it manually to verify where it is true postiive or false postiive.
2)Get user feedback to identify misclassified ones.
3)Using clustering techniques with metrics like Silhouette score, Davies-Bouldin Index, Calinski-Harabasz Index (CH), Normalized Mutual Information (NMI), or the Dunn Index.
4) Combine 1) and 3)? Identify some of false positives and then with clustering to find the similiar ones which are possibly also false positives

My end goal is to create a pipeline that will iteratively improve over time. How would you approach this problem? Thanks!

gloomy pulsar
#

Happy Friday, August 16: It’s all about AI and Automation! 🎉
Hello i am new there,)

I need your collective wisdom for an AI challenge! 🧠

My mission: Describe images with AI, and I’ve set my sights on LLaVA.

The issue: I’m a bit lost on how to choose the best approach! 🏊‍♂️

Quick context:
• I previously used OpenRouter (which used Fireworks)
• But it’s no longer available 😢
• I’m looking fto use Python
• I struggled this morning with PyTorch (persistent DLL file issues) 😅
• My laptop doesn’t have a powerful graphics card

What I’m looking for:

  1. An API rather than a local solution (too complicated for me right now)
  2. Cost-effective options
  3. Technically simple solutions

I’ve already explored a few options:
• Replicate
• Hugging Face
Fal.ai
• Google Colab...

But I’m a bit confused by all these options and their differences... 🤔

Questions for you, wise developers:
• What would be the best API for using LLaVA in my case?
• How can I navigate through all the variations of LLaVA?
• Do you have a simple comparison of the models (efficiency/cost)?
• Are there other options I might have missed?

I don’t want to dive in headfirst without understanding all possibilities first. Basically, how would you go about researching and choosing the best option?

Thanks in advance for your insights! 🙏✨

Please excuse my English if it’s not perfect, as I’m not a native speaker.

bold snow
#

any idea how to get started with machine learning without heavy math background?

agile cobalt
lapis sequoia
#

Please excuse my English if it’s not perfect, as I’m not a native speaker.

are you sure you aren't native speaker

agile cobalt
bold snow
lapis sequoia
#

my take is that you would rather get started and build an intuition with a library that makes stuff for you

gloomy pulsar
lapis sequoia
#

bc that makes learning easier afterwards, in a way decoupling terminology from concepts.

lapis sequoia
#

like fast.ai

bold snow
#

got it thank you very much

lapis sequoia
#

you can also check pytorch slowly, it's great and has got many examples

bold snow
#

thank you very much

lapis sequoia
#

np, don't just listen my advice though, that's one angle, the book suggested to you is another, and the book is very good.

agile cobalt
lapis sequoia
#

i didn't understand much of those posts (a lot of jargon for me to parse.) but this is somewhat simpler, it's just a small piece of those posts

agile cobalt
#

• How can I navigate through all the variations of LLaVA?
Using a different model should be as simple as changing the name of the model in one line of code
If you mean finding all variations that exist, browse though Hugging Face models or the models page of the API provider you plan to use
• Do you have a simple comparison of the models (efficiency/cost)?
You can look up benchmarks, but you should never expect for the benchmark performance to be an extremely good estimation of its performance in real tasks ; You must test and benchmark it in your own tasks

lapis sequoia
#

The post starts

By the Stone-Weierstrass theorem,
and the image i linked is the theorem.. :-(

#

(from https://arxiv.org/pdf/1302.4389v4, neat paper imho)
The theorem says:

In mathematical analysis, the Weierstrass approximation theorem states that every continuous function defined on a closed interval [a, b] can be uniformly approximated as closely as desired by a polynomial function.
The paper/image as well, but replaces polynomial with PWL (piece wise linear.)

#

(bc maxout networks can approximate any PWL, they say they are universal fn approximators.)

cinder tangle
lapis sequoia
#

sota in CIFAR-100 not improved in several years apparently?

north drift
#

Hey guys!

#

quick question

#

Are you aware of any AutoML llibraries that takes advantage of CUDA or GPU?

#

I am working with AutoGluon at the moment, seems to be CPU intensive even though I am using GPU parameters. Is there anything that correctly integrates with Nvidia CUDA as per your knowledge?

If so, please drop a reply! Thanks!

wooden sail
#

did you install the gpu version and set the configuration to use the gpu?

#

most standard/popular ML libraries CAN use gpus, but require you to set it up correctly

north drift
#

yeah, I am using autogluon[all]

spare forum
#

I don't have a GPU but I know you can use it with autogluon

north drift
#

I see. It seems to be keen on using CPU but lemme look into it further

wooden sail
#

different models and optimizers require you to tell them explicitly to use the gpu

spare forum
#

It use CPU by default, also it uses all available cores with agluon

gloomy pulsar
# agile cobalt > • How can I navigate through all the variations of LLaVA? Using a different mo...

thank you very much @agile cobalt !🙏
For your valuable response and there is a lot of information and I really like your sentence that I put in quote above 'V yes because indeed we get lost in all the proposed models that is why I wondered how experienced developers did it which is not at all my case who tries it is just to run a simple script in python!

As I am a beginner basically I am dependent on the information that you give me the artificial intelligences to guide me and in some cases I wasted too much time on useless choices when there was a very simple solution in two clicks so it is true that it is never easy to find the right decision in my case to know which way to start but thanks to your information I am already better equipped

lapis sequoia
#

damn

#

the amount of optimizers in nevergrad is crazy

#

I am looping through all of them to see which one is the best

lapis sequoia
#

the results are in

#

I asked all of them to solve a maze

#

the worst one goes to "HaltonSearchPlusMiddlePoint" (which is quasi random search and idk what is middle point)

#

the best one is LargeDiagCMA (evolutionary strategy)

#

whats crazy is

#

Accelerated random search is better than all of them

#

and no library implements it

#

300 algos though thats insane

lapis sequoia
#

because I let them all have 10 seconds and nevergrad is a bit slower and does 10 times less iterations (even random search)

agile anvil
lapis sequoia
#

https://corbin-c.medium.com/
Check out the MLP article. Is it accurate enough??

agile anvil
# serene scaffold No

Ok, then please let me ask about classifiers in general for https://www.accenthelp.com/blogs/accenthelpblog/british-isles-accent-map

AccentHelp

British Isles Accent Map When people talk about a ‘British accent’, they tend to be thinking of the upper class Received Pronunciation accent. But what you might not realize is that the UK has a huge variety of accents, and a higher level of linguistic diversity than many other countries. These range from the lilt of t

serene scaffold
#

It would be especially helpful if the dataset had speakers who contributed audio samples in more than one accent

nocturne valley
simple tapir
#

Hi

#

in yolo

#
model = YOLO("yolov8n.yaml")
model = YOLO("yolov8n.pt")
model = YOLO("yolov8n.yaml").load("yolov8n.pt")

same as

model = YOLO("yolov8n.pt")

?

#

in the first example, we create a model from stratch and transfer the params of pretrained yolo model

#

In the second one, we directly use the pretrained model

#

Is there any difference between them or are they just same?

lapis sequoia
#

some ways u could find out: 1. log the model's weights + arch, 2. inspect the config file, 3. Try on a sample x and compare, 4. Read the docs. But as a guess i'd expect so. @simple tapir

#

the yaml file may be for re-training (or creating a model from scratch.), but can't say for sure.

lapis sequoia
#

Hi, I am looking for a Python-oriented AI notes PPT presentation , like python basics then numpy, pandas, matplotlib libraries.
Thanks

jaunty helm
unique spoke
#

Hey guys, Have a question on how I can run my program through the input from my phone's camera

#

But didnt seem to work for me

indigo wing
#

Hey can someone tell me all Major steps and their minor steps in order. Like what comes first and followed by what. WHere do we start fron? data gathering > wrangling or elt pipelines > preprocessing and what part of it etc. I am very confused about the process in bits. like transformation itself is part of preprocessing, but what others are part of it and at what time or project does it come?

#

for ai, ds and dl

lapis sequoia
#

The quadratic loss assigns more importance to outliers than to the true data due to its square nature, so alternatives like the Huber, Log-Cosh and SMAE losses are used when the data has many large outliers.

#

never used the Huber loss but it seems quite common

#

im looking for some good book on constructing/designing/handcrafting loss functions w examples

indigo wing
lapis sequoia
rigid timber
versed pilot
lapis sequoia
loud sluice
#

Hey, I have a AI+Cybersec Hackathon Problem Statement
I'd like if anyone could give their insights as to how they would approach this and how you would interpret this

1. Automated data collection from RAW images (forensic images) and other formats using disk imaging tools 
2. Automate the scanning and analysis of data, including files, system logs, registry entries, network activity etc. 
**3. Identify indicators of compromise (IOCs) and related suspicious activities 
4. Integrate AI/ML algorithms for anomaly detection and pattern recognition. The AI/ML feature should incorporate a scoring system and recommendation engine that allow investigators to quickly focus on the important artifacts. **
5. User-friendly review options should include interactive timelines and graphical summaries, while comprehensive reporting capabilities should allow exports in various formats such as PDF, JSON, and CSV.```

Emphasis on the 3rd and 4th point

Thanks
#

correct me if im wrong, ig we have to make an Anomaly detection like tool for Real time packets.

#

but the scoring system part is kinda confusing(pt4)

midnight moon
#

Hello Everyone, I am a student. I want to workout in a product in machine learning. I know programming language like c,c++,java,python. I have also been learning books from Oreilly publication and YT channel. How should I get started ?

placid gazelle
#

hi

runic parcel
#
Answer the question based on the above context: {question}"""

SYSTEM_PROMPT = """Based on the following context: {context}, please recommend the best tools for the question: {question}. Provide the tool names only in a Python list format."""```
is this good for my Ai RAG, anything to add or remove for making it a good prompt by prompt engg?
unkempt apex
warm river
#

ok, which part of math should i practice for a.i ?

#

I am interested in a.i

#

ok

rigid timber
#

Im quite new to this

unkempt apex
#

new on what?

#

web or AI?

hard steppe
#

str(helper_llm.invoke(f"write the very short summarize & combine of the DuckDuckGo Search Result's Without Loosing Detail, Result:\n\n\n\n{result}").content)
I am Using Langchain, THe Above is the Prompt, How Can I tell AI to not to include the Here is a short summary and combination of the DuckDuckGo search results without losing detail:
curent Result:

Here is a short summary and combination of the DuckDuckGo search results without losing detail:\n\n**Summary:** OpenAI\'s CEO Sam ... interviews, including of members of the OpenAI Board of Directors.

Result I want:

I can use any mode which is available on groq

rigid timber
unkempt apex
worldly dawn
#

@radiant shadow @left tartan here too ^

leaden kayak
#

What are your favorite prompt tips when using language and code models?

#

For me it’s « PEP8 style format » after a Python request

serene scaffold
leaden kayak
#

Getting more quality outputs from local models I use daily

#

It’s a question with broad applications

serene scaffold
spare forum
spring field
#

I'm just curious, is this what you're expected to do throughout the internship? (it just doesn't seem like you'd be doing much of "generative" AI)

serene scaffold
#

Looks like the internship is about building a data pipeline. It doesn't look like you'll be doing anything with any variety of AI. But your pipeline might support people who will.

serene grail
past meteor
#

In many jobs and roles (all the ones I've had) they were one person doing both

#

You can have a data engineer without a data scientist / ML engineer but not vice versa. If your future employer makes the mistake of hiring a ML person without a data engineer then you'll have to (be willing to) do both

serene grail
agile anvil
# versed pilot This is a bit of a huge task. Accent alone from dialect are two different things...

I disagree, the outlines around classification have been drawn decades ago, by those measuring the first and second formants of vowel shifts: https://www.cambridge.org/core/journals/journal-of-the-international-phonetic-association/article/abs/formant-frequencies-of-vowels-in-13-accents-of-the-british-isles/857541BE2E95A40117CBF24DE5836F6E

clear ore
#

HI, How are you , Can you please telll me i little bit confuse what i learn next i complete PYthon Bootcamp , Which field is best Data Science , Data Analyst , Cyber Security , or AI Enginer . My Self ....... My name is Danish , I do BS in Information Management from Punjab University Lahore. Thanks !

versed pilot
rigid timber
unkempt apex
river cape
#

Hi guys is it hard to build an image generation model without the use of any gpt?

buoyant vine
#

Hard

#

You need a lot of data

dense lichen
#

Hey guys

I just got an idea but don't know where to start from.
So we use a postgresql DB
I was wondering if someone could guide me on how I can like just give a chat prompt for people, and an LLM model could understand based on the schema and table descriptions that I am going to provide.

I want to train the model with the database schema and its descriptions.

What i want to do is help people not giving information to all the common platforms they use like chatgpt, claude etc, and just train my own model. this way the users dont have to keep explaining to the AI to get answers.
I am a professional, however im very new to all these so just wanted to know if this is something already done or any tutorials that could help me with it.

lapis sequoia
river cape
lapis sequoia
river cape
indigo wing
#

I ran a model 100 times in dev, I want to take a higher value from the highest value so that I can take care of the performance in prod. What is the appropriate percentage above the upper bound.

#

example tc 10-100 seconds in dev. There are also ram constraints as 100% ram being used

#

what's the % I should add above the upper bound that's not too extreme a case

#

10% seems too much if it takes model 10 s, and to much if it takes 1 week

#

I want my data to determine my extremes.

#

So that when I am working on my code in dev, and test it. It will throw error if it takes more than the acceptable range

lapis sequoia
#

it's quite weird that the loss fn can be considered as just a fn you want to minimise, vs having some relationship to statistics

#

i wish one day ill understand that :-(

#

seems how wikipedia starts the page about Loss

#

maybe learning how linear regression can be seen as an optimisation problem (least squares, by calculus or linear algebra approach.) or some statistics problem can help, idk.

indigo wing
compact valley
#

Trying to figure out which OS to use for data engineering before I jump into learning
I've been a web dev for few years now and will transition to data by new year

#

I am comfy using any and all win/linux/macos

#

I just wanna know like what is the preference in workstation/laptop setups real data engies use

river cape
river cape
compact valley
river cape
compact valley
river cape
#

One of the best tools

river cape
compact valley
#

I really really hope so, cuz I have no experience with data engineering tools and hoping that there will be no issues

#

like compatability and stuff idk, just wanna take the job with me to cafe if I want to
and MacBooks are awesome for that

lapis sequoia
#

Empirical risk minimization is a principle in statistical learning theory which defines a family of learning algorithms based on evaluating performance over a known and fixed dataset. The core idea is based on an application of the law of large numbers; more specifically, we cannot know exactly how well a predictive algorithm will work in practi...

compact valley
spare forum
compact valley
lapis sequoia
spare forum
#

Most company forces you the OS anyway, It's just a pain for spark and things like that, it's fine but not the best

compact valley
lapis sequoia
#

Background? i.e 2nd section

compact valley
lapis sequoia
#

ive got a mac mini w silicon chip, they are quite nice and cheap from what i see (2nd hand)

spare forum
#

Sound good

lapis sequoia
#

if one assumes P(x,y) -or joint probability distribution- exists it seems that P(y|x) (or conditional distribution) makes sense for DL. That should be what the model ends up estimating.

#

P(y|x) is just a slice of P(x,y)

#

i don't think the pixels are independent variables though..

radiant rock
#

hey guys does anyone know any ai tools that automatically create flashcards for study? i like quizlet but it doesn't include everything

spare forum
lapis sequoia
#

im trying to map this fn to DL as well (the Risk), i'd asy the integral is normally a sum, L the loss but can't see dP(x,y) mapping to anything

#

if one has \int sin(x)dx then maps to \sum sin(x) delta

spare forum
#

dP(x y) could mean it's for classification or regression, it's a measure, dw it's just a very general way of writing

spare forum
lapis sequoia
lapis sequoia
leaden kayak
spare forum
fiery bane
#

just get a pretained model, done.

lapis sequoia
#

lol

spare forum
fiery bane
#

now if he ask "but it has to be good" I'll answer: if you have few million dollars, you can do it by funding me and I'll do it for you

lapis sequoia
#

he meant general pretrained model :-)

fiery bane
#

what's a general pretained model?

lapis sequoia
#

any pretrained model, it was just a bad joke

fiery bane
#

lol ok haha,

spare forum
lapis sequoia
#

isn't this what i meant

lapis sequoia
fiery bane
lapis sequoia
#

im trying to understand a neural network in statistical terms as opposed to an optimised function, or something close to that @fiery bane

#

this article is so far the simplest description ive found, https://en.wikipedia.org/wiki/Empirical_risk_minimization
though not quite complete.

Empirical risk minimization is a principle in statistical learning theory which defines a family of learning algorithms based on evaluating performance over a known and fixed dataset. The core idea is based on an application of the law of large numbers; more specifically, we cannot know exactly how well a predictive algorithm will work in practi...

spare forum
#

it applies to all supervised learning tbf

serene grail
# lapis sequoia seems how wikipedia starts the page about Loss

Isn't loss basically the difference between true predictions (true positives, true negatives) and false predictions (false positives, false negatives)?
So this is what you're trying to minimize, 0 loss -> you estimate lines up with true values 100%
(I'm a noob so take this with a mountain of salt)

fiery bane
lapis sequoia
#

but it's possible to see it statistically

lapis sequoia
#

if edward witten likes it, i wont understand it

#

but yeah, it includes a lot of fantastic stuff

fiery bane
#

lol ias people

lapis sequoia
#

that looks good, nice to see some physics formulas there

#

thank you both @fiery bane @spare forum

spare forum
#

It would be easier to find smthing like a master degree course or something

spare forum
#

Finding articles etc...

fiery bane
#

Well, maybe all he needed was that one article I posted lol

spare forum
#

Just saying, bc on those courses you have a bit of everything centralized with the most important, may be heavier maths tho

fiery bane
#

that's true.
I think the best combination is if there's a text book, and a course based on just that textbook

lapis sequoia
#

finally understood the formula (approx)

#

the risk minimisation formula is:

  1. weighing the error (loss) with the probability of that instance,
  2. adding all up (integral in terms of x and y vectors) aka risk or probability of error,
  3. and minimising it (wrt to the weights.)
#

(in practice, it ends up being the standard mean-loss minimisation by backpropagation.)

fiery bane
#

yea haha pretty much

#

have you read bishop plmr?

lapis sequoia
#

i dont know much and need to go in steps

#

never heard of this before https://en.wikipedia.org/wiki/Riemann–Stieltjes_integral but it was useful for it

In mathematics, the Riemann–Stieltjes integral is a generalization of the Riemann integral, named after Bernhard Riemann and Thomas Joannes Stieltjes. The definition of this integral was first published in 1894 by Stieltjes. It serves as an instructive and useful precursor of the Lebesgue integral, and an invaluable tool in unifying equivalent f...

fiery bane
lapis sequoia
#

actually looks fantastic, great plot quality

#

last few days i read 5 papers and saved about 50

#

XD im falling behind

fiery bane
lapis sequoia
#

no, i just want to understand

fiery bane
#

good luck!

lapis sequoia
fiery bane
#

I don't need luck.
I need miracles T__T

lapis sequoia
#

u theist?

fiery bane
#

yea sure

ionic valley
#

do AI/ML positions ask for LC during interviews?

unkempt apex
#

getting error while installing packages with pip

#

on aws ec2 instance

#
WARNING: pip is configured with locations that require TLS/SSL, however the ssl module in Python is not available.
WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError("Can't connect to HTTPS URL because the SSL module is not available.")': /simple/pip/
WARNING: Retrying (Retry(total=3, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError("Can't connect to HTTPS URL because the SSL module is not available.")': /simple/pip/```
#

this one

faint quail
#

how does pytorch, tensorflow and other neural network frameworks save the weights and biases so effeciently? I'm making my own module from sratch and I notice that my file sizes are in the gb's and all I did was copy the yolo v1 architecture

https://github.com/TheonlyIcebear/Neural-Net-Framework/blob/main/utils/network.py

GitHub

I custom library I made for training neural networks from scratch, using numpy and scipy - TheonlyIcebear/Neural-Net-Framework

#

is it just that the data is compressed using some algorithm?

agile anvil
# versed pilot So that limits the scope only to accent, not to dialect, and only to vowels

I'm not interested in classifying dialect, just accent, partly because I believe simply identifying the vowels' first and second formants will provide as much geolocation information as more detailed examination of speech. It still requires dictation with phonetic time points (e.g. a first pass with a STT service like AssemblyAI, a second and third pass for forced alignment of words and phonemes with PocketSphinx, and a fourth pass putting each voiced phoneme into a formants extractor.) That could build a classifier with enough training data.

Unfortunately, nobody seems to have labeled training data of just "people native to $CITY, UK saying things". Is it even possible that doesn't exist somewhere yet?

serene scaffold
#

just leaving this here.

faint quail
tidal bough
proper crag
#

to enhance model's training time efficiency .....is it enouf if i install eGPU only?

#

im using macbook

#

and im planning to deploy the model to docker then connect the model via an API to my EVE-NG

#

bczu i wan the model to analyse time series data from my lab which is residing inside EVE-NG
EVE-NG is virtual environment for computer networking

brave yew
#

can you guys tell me what your development environment is for working or fine tuning models? I as a student use a gaming laptop until i blew my gpu a while ago, it wasn't that strong (GTX 1650) but it got the work done, but now i only have integrated graphics to work with which is abhorrent, so... what are some places where i can migrate my project to, to get gpu access?

unkempt apex
#

But then working with it maybe you need to adapt for cloud gpu providers or maybe AWS things

#

I have tried both ( AWS and colab) , it depends on what you wanna do with and how big is your mode

proper crag
brave yew
unkempt apex
proper crag
#

also didnt wrry for the network it just my own smoll network lab project and im myself have been majoring in computer networking till degree

spare forum
unkempt apex
unkempt apex
brave yew
unkempt apex
brave yew
#

for finetuning models using pytorch will i require colab pro?

proper crag
unkempt apex
#

You can still fine-tune within time limits

proper crag
#

for model like SVM is it CPU or GPU focused?

lapis sequoia
spare forum
proper crag
#

oh

#

ok..bcuz it uses SVM

unkempt apex
proper crag
lapis sequoia
#

sneak peek for the curious

unkempt apex
#

Yeah it also depends on dataset

#

But I still use ryzen 3 3200g

proper crag
#

i mean asked you that time

unkempt apex
#

Same sir

#

But if dataset is on kaggle you ca directly use there notebooks

proper crag
#

i mean i wan to connect the model

#

to an application which is used for my virtual computre networking lab

unkempt apex
#

Wdym mean by connecting model?

You can host that model and I tegrate API's ( just like GPT)

Or maybe add the model on kab, and then use that with short python code

proper crag
#

EVE-NG computer networking virtual enviroment

proper crag
#

app

unkempt apex
#

Ahh, then I am not familiar with that

#

I guess you have to use API then by hosting your model

proper crag
#

an application, my computer networking lab is inside the app and i wan to connect the model to analyze traffic data of my lab

unkempt apex
#

Ha e you ever tried integrating API calls on app?
Any app

proper crag
#

i'll try to search

#

although ty

unkempt apex
#

But you have to host the model on webserver then

#

Wait lemme search for that then

serene grail
lapis sequoia
#

nice, if you do we can discuss it

#

at least the intro, idk how complex it gets later so i might not be able to discuss the rest XD

unkempt apex
brave yew
#

wait... you can't use terminal in colab? how do you import libraries?

unkempt apex
#

Just import it in code

brave yew
#

damn i am dumb

lapis sequoia
#

paid colab has terminal

lapis sequoia
#

It's somewhat reasonable that AI won't work out of distribution, but does it learn generalisable units that can be easily learn out of distribution? The answer is to some extent yes (fine tuning, and other approaches), and no (they can't solve ARC challenges.)

#

Why does this happen?

#

but most of those guys (see bengio and karpathy, now seem to disagree!)

serene grail
#

I don't know, I feel like this is the sort of question the leading experts are trying to solve and I don't have the knowledge
I mean, fundamentally you should be able to learn to generalize based on limited information because humans do it. That's the thing.
So maybe the approaches we are using are just not yet good enough, like we haven't discovered how to make machines that "learn concepts" in the way that allows for this sort of generalization

#

Some people would say "just throw more compute at it" but idk about that 🥴

lapis sequoia
#

Yes, I agree, it's quite puzzling

#

that paper says:

In this work, we will adopt a more unified approach that addresses these problems from within the framework of connectionism.
we'll see (the "problems" are of creating more abstract, symbolic units; and "connectionists" is just standard deep learning.)

#

visually, it looks like this (the last bit is similar to Marvin Minsky's diagrams.):

main sluice
#

Hi fellow data scientists

remote stream
#

Bois is anyone interested in helping me in a project

#

Abt voice keyword detection

#

It's for a competition. I need help in vc

serene grail
lapis sequoia
#

imho the notion of "object" isn't that difficult to learn, isn't SAM (Meta's Segment Anything Model.) excellent at that?

#

im not sure whether it knows an object from a part of it, though, but does not confuse them in a way..

serene grail
#

I don't know anything about SAM
But also "object" is kind of a really vague term

lapis sequoia
#

yeah, there are papers about what an object is...

serene grail
#

Like, anything is an object. You can say that any part of an object is an object, any property of an object is an object, any action is an object, etc.
If we're talking about a "mental object", which again, I'm not defining well so maybe what I'm saying doesn't make sense 🥴

lapis sequoia
#

you can directly use sam

serene grail
#

Oh nice, I'll look into it later

lapis sequoia
#

i love this one:

#

you hover, it selects the dog

serene grail
#

If it's computer vision, it's more about detecting objects right?
I think the paper talked more about the ability to reason about objects (from reading the beginning)

serene grail
lapis sequoia
#

yes, but for that you need segregation

#

one problem seems that networks have all the information merged

serene grail
lapis sequoia
#

exactly

#

and the reasoning is the composability

#

so i'd do: SAM => NN 1 => NN 2 say

#

NN 1 may not be necessary actually. that's only for CV

serene grail
lapis sequoia
#

in my mind binding problem == not having segregation

remote stream
#

Guys is there someone who's willing to collaborate with me on a keyword detection project I don't have much knowledge in that field any help is appreciated

lapis sequoia
#

read sects 1 & 2, will read 3 tomorrow likely

serene grail
#

Nice, I only read section 1 so far

lapis sequoia
#

ended up reading 3 as well, not all the details though, and will instead read 4 + 5 tom.

warm mortar
#

Any ESRGAN expert here??

serene scaffold
warm mortar
serene scaffold
#

by waiting for someone who thinks they know about ESRGAN to present themselves, you're creating extra steps for that person if they ever appear, and preventing other people from potentially helping.

polar zinc
#

Hi, does anyone know how I can plot a line for average increase over time using one axis with Matplotlib?
Usually I do it based on 2 axis but cannot get any methods to work using 1
example data = [10,20,12,14,12,9,15,18,12,10,15,14,17,10,20]
The other axis is the days. Example: ["10th July", "11th July", "12th July", "15th July", "16th July", "20th July"]

left plover
#

np.mean()

warm mortar
spring field
warm mortar
#

My google collab show this whenever I upscale the videos using ESRGAN

#

How to solve this issue??

spring field
#

this appears completely unrelated to ESRGAN apart from it being somewhat involved in the process as a whole unamusedowo

#

and there's not enough information to answer your question, you haven't defined a name bing_shrug
to fix this, you would define this name

#

if it works locally but not on google colab, consider it being an issue with environment compatibility, for example, you're using a newer or older version on google colab than locally

warm mortar
spring field
#

how did you get it "from someone" if they are unreachable?

spring field
warm mortar
warm mortar
spring field
#

I would assume the same thing, but it's just not defined in that code path

#

and you still haven't provided more information
well, I suppose you asked whether you should and I didn't respond to that...
anyway, paste your code here: https://paste.pythondiscord.com

spring field
#

!paste no, paste it here

arctic wedgeBOT
#
Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

warm mortar
spring field
#

send the link to the paste

warm mortar
warm mortar
spring field
#

can you point me to where in the code is pre_upscale defined?

warm mortar
#

Line number 56

spring field
#

no, where is it defined?

#

that's where it is referenced

warm mortar
spring field
#

which line of code is that?

#

do you have some understanding of how Python works? because I feel not and if that's the case, I would advise you to start from the beginning, you can check out the resources linked below to get started

#

!resources

arctic wedgeBOT
#
Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

warm mortar
warm mortar
spring field
# warm mortar

as I said, it's not defined there, it's only referenced there, you need to define it first

and since you don't quite seem to understand that, I feel as though you should pick up on the basics before attempting such endeavours, that'll make it easier for you down the road too

spring field
#

I'm afraid it won't be that simple, for one, I have no idea what that function is supposed to do, and two, it would take quite a bit of effort to define it (probably), so, again, I would suggest you start with the basics and slowly work your way up

warm mortar
spring field
unkempt apex
#

first time using U-Net model, ,, so this is after 20 epochs,
but the mask should only contain the lines for road

faint quail
#

nnuh uh

faint quail
#

if so it seems to be doing its job

unkempt apex
#

like this one, where it is only creating lines

spring field
#

what was the input for that?

faint quail
#

it seems like it is only detecting the lines over a certain thickness, so likely needs more training time / model capacity or its a dataset issue

#

idk tho im prolly wrong

unkempt apex
#

num_classes = 1
is it okay?
because I only want that white lines?

spring field
#

supervised or unsupervised?

unkempt apex
#

supervised

#

dataset is on kaggle also

spring field
# unkempt apex like this

if these are the masks then you probably don't want to have a grayscale mask as an output, you want to convert pixels above a threshold to pure white and below the threshold to pure black and perchance calculate the loss with that

unkempt apex
#

this is the code I used while loading dataset

#

so all masks are grayscale

#

so do you think this is bothering it?

spring field
#

I meant that you convert the output of the network to black and white, instead of having values in between

unkempt apex
#

or I should use mask images as it is?

spring field
unkempt apex
#

lemme see how it can be done

spring field
#

lemme know how it goes, I'm curious as well 😁

unkempt apex
#
def predict_single_image(img_path, model, transform, device, threshold = 150):
    image = Image.open(img_path).convert('RGB')
    image = transform(image).unsqueeze(0).to(device)
    
    with torch.no_grad():
        output = model(image)
        output = torch.sigmoid(output)
        output = output.squeeze().cpu().numpy()
    
    # now applying threshold
    binary_mask = (output> (threshold /  255.0)).astype(np.uint8) * 255
    return binary_mask
#

is it good?

#

nah, still not getting correct output

spring field
#

you can just use np.where

unkempt apex
#

where to use .where?

#

and why?

spring field
#
binary_mask = np.where(output > (threshold / 255.0), 255, 0)
unkempt apex
#

okay so no matter what I change the threshold, it still gives me this

#

tried 0.5 also

unkempt apex
#

no visible changes

unkempt apex
#

yeah

spring field
#

then it, uhh, doesn't make sense, if something were off with the threshold, you'd be getting either a completely white or a completely black image

#

(did you save the code?)

unkempt apex
#

yeah auto save on vs code

spring field
#

well, surely something's not running

#

did you rerun the code?

#

how are you displaying the image?

#

can you go into a debugger and look at it?

unkempt apex
#
import torch
import torch.nn as nn
from torchvision import transforms
from PIL import Image
import matplotlib.pyplot as plt
from model import UNet 
from customDataset import CustomDataset  


model_path = 'best_model.pth'
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

model = UNet(n_class=1) 
checkpoint = torch.load(model_path, map_location=device)
model.load_state_dict(checkpoint['model_state_dict'])
model.to(device)
model.eval()


transform = transforms.Compose([
    transforms.Resize((256, 256)),
    transforms.ToTensor(),
])

def predict_single_image(img_path, model, transform, device):
    image = Image.open(img_path).convert('RGB')
    image = transform(image).unsqueeze(0).to(device)
    
    with torch.no_grad():
        output = model(image)
        output = torch.sigmoid(output)
        output = output.squeeze().cpu().numpy()
    
    return output

test_image_path = 'random.jpg' 
predicted_mask = predict_single_image(test_image_path, model, transform, device)

# Visualize the result
original_image = Image.open(test_image_path)
plt.figure(figsize=(10, 5))
plt.subplot(1, 2, 1)
plt.imshow(original_image)
plt.title('Original Image')
plt.axis('off')

plt.subplot(1, 2, 2)
plt.imshow(predicted_mask, cmap='gray')
plt.title('Predicted Mask')
plt.axis('off')

plt.show()

#

this is the whole test.py if you want

#

also , why to use debugger?

unkempt apex
spring field
spring field
unkempt apex
#

wtff sorryt

#

it's late night here and I am still awake with half open eyes😂

#

need to sleep but after this testing

#

okay threshold is working

#

but not getting accurate

#

for example, setting 150, giving full black image

spring field
#

perchance need to lower it
but also, try training on that mask

#

use it for calculating the loss from the ground truth and such

unkempt apex
#

how?

#

by specifying mask_image with threshold?

spring field
#

mmm, I'm not sure, maybe what I'm thinking of is more suited for a metric insetad of a loss, I was thinking of essentially using log loss and comparing the masks you get from the model after applying this threshold to the ground truth image pithink

unkempt apex
#

is it problem in training?, because testing seems to be simplen now, ( just use random.jpg and generate mask according to model is being trained )

spring field
#

I mean, clearly the model has either not had enough training or the training was ineffective

unkempt apex
versed heron
#

hey guys, some friends and I are working on some hackathon projects this month in the DS/ML space. if anyones interested in joining in shoot me a message!

fallow tree
#

guys

#

hello where can i find some free open ai Api just for testing

serene grail
fallow tree
#

Another question please

#

is there any alternatives for Google colab pro ? free with more ram , cz i cant work under 12.7 Gb Ram

proper crag
#

is google collab free?

#

if i wan to use it to host my model

serene scaffold
serene scaffold
agile cobalt
#

Hugging Face Spaces is pretty generous for model deployment tbh

I don't think you're going to find >12GB RAM for free without strings anywhere though

devout fable
#

hey, I just joined. Can anyone suggest a library for transforming excel files into markdown which preserve as much as possible of the original formatting? I've done openpyxl -> pandas -> markdown, but you lose a lot of formatting there.

versed pilot
heavy lily
#

Hii

#

Can someon help me with something

#

I am getting my data like this

#

But i want it like this

devout fable
#

data2.head()

versed pilot
#

or even just data2 ?

main citrus
#

Works too

#

You should remove the print

cosmic willow
teal sapphire
cosmic willow
#

thx checking rn

teal sapphire
#

Give me 10 minute to write code for your evaluation function to make sure its correctly computing accuracy

cosmic willow
#

i can wait thx

#

also as i see that should be a speed up but it only avoids to run it twice doesnt really improve the fact that my score seems to cap at 60.7%

teal sapphire
#
    test_results = [(np.argmax(self.feedsforward(x)), np.argmax(y)) for (x, y) in test_data]
    return sum(int(x == y) for (x, y) in test_results) / len(test_data)  

(Evaluation function)

#

try @cosmic willow

small wedge
#

you don't need to convert them to int btw

#

since bool is a subclass of int

teal sapphire
#

you right

small wedge
#

not that you need to really optimize an evaluation function

teal sapphire
#

but yea

small wedge
#

ofc ofc, I just mean it's a tiny tiny part of your runtime if you're training a model

teal sapphire
#

Yes

#

optimizing stuff like

#

model artitecture is more important

small wedge
#

yeah

cosmic willow
#

how could i implement printing the train accurasy too? i feel like it may be learning that only.

small wedge
#

the function spartan gave you does calculate accuracy, just print it

cosmic willow
teal sapphire
#

u can add a method to calculate training accuracy

#

similar to evaluate

#

and you can update the learn method to print the accuracy of both trainings and test datasets

hard fern
#

finally got my first data science job!

teal sapphire
cosmic willow
#

the train and test accurasy seems to be the same but still it starts at like 60% goes to 67% and goes up and down there

solemn warren
cosmic willow
craggy agate
#

Is a 4080 Super good enough?

#

I know that it is capped at 16GB VRAM.

serene scaffold
craggy agate
agile cobalt
craggy agate
spare forum
serene scaffold
#

basically, any CUDA-enabled GPU is good enough for whatever you can fit on it. And if you're training/fine-tuning a model, you need to factor in the memory footprint of that as well.

If you're trying to fine-tune an interactive LLM that came out within the last year, gaming-tier GPUs might not be enough.

serene scaffold
#

calculate how much VRAM you would need to load the 8B param model, and then how much extra room you would need for fine-tuning.

#

if that can fit on a 4080, yay. if not, you might be able to quantize it.

spare forum
#

You can use some computational ressources from any cloud provider, don't need to have the latest GPU at home

serene scaffold
#

I've never done the math on that, but buying compute time on an enterprise GPU for the specific experiments that you want to do is probably going to be cheaper than buying a gaming GPU.

#

(I don't train models on my gaming computer because then I wouldn't be able to game while I wait for the model to train.)

spare forum
#

There is no way it's economical to buy entreprise GPU for this, it's more of a geeky satisfying thing

#

(which is fine)

agile cobalt
craggy agate
# agile cobalt use a cloud gaming service to free up your GPU /s

Can I combine a bunch of these with NVlink or SLI:
https://www.amazon.ca/dp/B09SJ2BZ85?ref_=cm_sw_r_cp_ud_dp_QGCCXX2FHAP0HQX8S34R

#

Actually, 2 of them would also do the job

#

24GB VRAM

serene scaffold
craggy agate
#

By tasks I mean models and Datasets

serene scaffold
#

having two 12GB GPUs is worse than having one 24GB GPU, because data will occasionally need to move from one device to the other.

craggy agate
#

Whereas a 4090 would be nearly 1k$ more.

serene scaffold
#

sure. I'm just letting you know that that's how it works.

craggy agate
serene scaffold
#

I'm not sure

craggy agate
#

Either this or 2 used 3090s.

#

Cause new ones are very expensive.

jaunty helm
#

for running the llm you need a lot less

craggy agate
#

does it help with VRAM limitations as well?

jaunty helm
craggy agate
jaunty helm
jaunty helm
# craggy agate I see thanks!

actually, I see people buying tesla P40s for inference, not sure how they are when it comes to training
they're pretty old, but has 24gb vram and definitely cheaper than a 4090

craggy agate
#

I found this for $200

#

It seems to have 24GB RAM

agile cobalt
#

4992 cores sounds pretty low?

craggy agate
#

Not bad

#

not good either

#

but not bad, especially for the price.

agile cobalt
#

4080 SUPER was 10240?
hmm not as low as I thought

craggy agate
#

but won't the gen of the CUDA cores also play a role?

#

or are they all the same?

jaunty helm
# craggy agate https://a.co/d/dBwfBMz

that's a k80, even older than a p40
nonetheless, these are all old cards, and thus support an old version of CUDA, so I'd check compatibility at least before purchasing

agile cobalt
jaunty helm
#

like it's kinda special hardware and it doesn't function as your normal gaming gpu for example

serene grail
craggy agate
craggy agate
jaunty helm
craggy agate
jaunty helm
lapis sequoia
#

Metaphors We Live By is a book by George Lakoff and Mark Johnson published in 1980. The book suggests metaphor is a tool that enables people to use what they know about their direct physical and social experiences to understand more abstract things like work, time, mental activity and feelings.

serene scaffold
lapis sequoia
#

just thought it was interesting

serene grail
#

Is this related to the paper about how NNs don't have separate mental representations that you linked before?

lapis sequoia
#

yeah, last part of the paper, they say that perceptions are the basis of concepts

#

this is less related, but really crazy https://en.wikipedia.org/wiki/Ideasthesia

Ideasthesia (alternative spelling ideaesthesia) is a neuropsychological phenomenon in which activations of concepts (inducers) evoke perception-like sensory experiences (concurrents). The name comes from the Ancient Greek ἰδέα (idéa) and αἴσθησις (aísthēsis), meaning 'sensing concepts' or 'sensing ideas'. The notion was introduced by neuroscient...

#

(that the trigger of some perceptions or the why is semantic, as its believed in the case of synesthesia.)

serene grail
# lapis sequoia yeah, last part of the paper, they say that perceptions are the basis of concept...

That's interesting, I wonder what a "perception" would be for an NN
I've heard some people say that one of the major things that prevent these models from being closer to human performance is that they don't "learn on the fly", so to speak. And humans do, you learn something from every perception
But I'm not sure how human brains do this, is this just because biological neurons are extremely different from artificial "neurons" or is there something else

oblique isle
#

guys what is the best environmnt to train and work on a chatbot ?

lapis sequoia
serene scaffold
lapis sequoia
#

that they don't "learn on the fly", so to speak.
i think some are trying related stuff to solve the arc problem (by chollet; he offers 1M prize.)

#

not sure though, i vaguely remember.

oblique isle
#

so clearly i need a cloud env or smtg like this

serene scaffold
#

Yes

oblique isle
#

what do u suggest

unkempt apex
#

okay so right one is predicted mask , but it's not that accurate

oblique isle
#

thanks

left plover
unkempt apex
left plover
#

It has better edge detection

unkempt apex
#

heh? bruhh I am using U-Net model to train the images, so why to put canny here ?

spring field
spring field
unkempt apex
lapis sequoia
#

Isn't this a proof that in NNs the inputs can be considered random variables ? https://en.wikipedia.org/wiki/Independent_and_identically_distributed_random_variables#In_machine_learning

In probability theory and statistics, a collection of random variables is independent and identically distributed if each random variable has the same probability distribution as the others and all are mutually independent. This property is usually abbreviated as i.i.d., iid, or IID. IID was first defined in statistics and finds application in d...

unkempt apex
unkempt apex
lapis sequoia
#

the random variable is the set of pixels (for images), each time for example.

spare forum
#

Pixels are not iid

#

What you send relates to ml with tabular data

lapis sequoia
#

each time you withdraw an image it comes from the same distribution

#

i.e the training set

#

so it is iid imho

serene grail
#

is every pixel fully independent?