#๐Ÿ”ฌโ”Šresearch

1 messages ยท Page 1 of 1 (latest)

lucid echo
#

Hey folks! Kicking this channel off with a "meta" message. ๐Ÿ™‚ If you come across any interesting papers / blogs, please share and discuss in the #1130529466230247444 forum channel. As this Discord grows, I'll be looking into setting up some events for things like paper reading groups, invited talks / AMAs with researchers (and practitioners), demos / workshops, etc. If you have any thoughts on what you'd like to see or you yourself would like to give a talk or something, let's discuss here!

#

Then, to kick off some discussion ... what's up next on your reading list? ๐Ÿ‘€ And how do you organize what you want to read next? ๐Ÿ˜ฑ

hazy helm
#

Hey everyone ๐Ÿ˜„
This paper has been in my pending list for some weeks, so will start with: https://arxiv.org/abs/2306.11644 (Textbooks are all you need)
Also a survey paper on LLM Evals (https://arxiv.org/abs//2307.03109)
I was working on Text 2 SQL in my previous company, so came across these two papers. I tried organizing my notes on Notion but it is not free now so using Notepad again ๐Ÿฅฒ Want to try a different notetaking App, any suggestions?

pearl violet
hazy helm
#

Yes, somehow it mentions that I have utilized my workspace completely and I need to upgrade my plan, previously I used Notion heavily for many of my writeups and sections.

pearl violet
hazy helm
#

Lol that was what I was thinking when we had this convo ๐Ÿคฃ

viscid sigil
#

My TBR is impossibly long and growing, but this week's reading list is..... at least more manageable ๐Ÿ˜‚

https://arxiv.org/abs/2307.10169 (Challenges and Applications of Large Language Models - I love a good survey paper!)
https://dl.icdst.org/pdfs/files/236e636d7629c1a53e6ed4cce1019b6e.pdf (an old paper but a good paper: revisiting "High-Dimensional Data Analysis:
The Curses and Blessings of Dimensionality")
https://arxiv.org/abs/2301.04856 (Multimodal Deep Learning - I passed this one by when it first came out because I was busy catching up on shorter papers, excited to dive into it now)

lucid echo
#

I've just added some tags to the #1130529466230247444 channel! You can use these when you share papers. Once we have lots of papers shared ๐Ÿคž they'll be helpful for sorting through and finding ones you're interested in reading and discussing! ๐Ÿ™‚

thin hinge
#

I will be contributing to the list, and will join the discussions from next week! ๐Ÿค˜๐Ÿค˜

lucid echo
#

Now you have to share a tabular paper ๐Ÿ˜†

dire phoenix
#

@lucid echo I did but it seems to be greyed out. Does it need some form of approval ?

lucid echo
#

it's more grayed out on my screen b/c i have it selected, just a discord UI thing

dire phoenix
#

Ok yeah some UI choices are confusing. I have done some editing. (I didnโ€™t know it was a discussion.)

lucid echo
#

BTW if you ever want to organize something like a paper replication workshop / event, LMK. I can help set it up!

limpid finch
#

I'll definitely be interested in paper reading group

#

I mean how are we to discuss the latest cutting edge research without discussing the papers coming up with said research?

#

also this is unrelated but I think you should make a thread for each competition instead of a full blown channel

#

since I doubt any one competition channel would have much activity once the competition ends

#

and it'd be a shame to delete the channel losing all the convo that happened there

#

but leaving them pile up would also make the server hard to navigate

dire phoenix
#

Is there a way to have default filters in papers ?

wintry stirrup
#

Hey guys! I am new to Machine Learning and to the whole data science world. I am a computer science student and I have just written my first article. It is about using the p-value and the R-squared methods for variable reduction. I would love any feedback for me to improve. Thanks a lot! Here is the link:
https://www.linkedin.com/posts/yaser-jafar-4b89b7276_the-r-squared-and-p-value-methods-for-model-activity-7099187273958588416-O7QI?utm_source=share&utm_medium=member_desktop

I am thrilled to announce my first article ever in the #DataScience world. In this article, I delve into the intricacies of two crucial statisticalโ€ฆ

pale pollen
# limpid finch but leaving them pile up would also make the server hard to navigate

Thanks for the channel feedback, we're definitely still experimenting to see what works. The plan for competitions right now is that closed competitions will get moved to a new category, then after a couple of months they will move to an archived category (where they will be read only). So they will be available to go back to, but should hopefully avoid clogging things up. If you have any other channel feedback, please feel free to add it to #1130786027699707915

limpid finch
#

aight ๐Ÿ‘

wide portal
#

Hello,

I'm currently working on a time series project, and I intend to employ the EMD+CNN technique for forecasting the output. Upon applying EMD to the training data, I obtained a total of 14 Intrinsic Mode Functions (IMFs). Consequently, I constructed my CNN neural network with dimensions (30100, 20, 14, 1), with 20 representing the window size. However, I encountered an issue when attempting to decompose the test data using EMD, as it produced only 11 IMFs. This inconsistency caused an error when trying to execute the CNN model.

I have two questions: Is there a method to enforce a consistent number of IMFs during the EMD decomposition process? If not, is there an automated way to select the most significant IMFs?

Please note that I am utilizing the EMD-signal library in Python.

Thank you.

crystal hound
#

๐ŸŒŸ Anyone interested in an exciting scholarship opportunity offered by the AI and Cyber Futures Institute?!

https://www.linkedin.com/posts/ganna-pogrebna-7a846493_phd-scholarships-phdscholarships-activity-7105334874907054080-zv9c?utm_source=share&utm_medium=member_desktop

๐Ÿ“ข Exciting Scholarship Opportunity for Aspiring Researchers! ๐ŸŽ“
We are thrilled to announce that AI and Cyber Futures Institute will offer #PhD #scholarshipsโ€ฆ

warm prawn
amber fog
fervent locust
#

To my eye it appears that there are 525 directories, meaning that many classes. Yet in the end you have only 411 classes. An educated guess is that space characters in directory names are causing this issue. For example, VIOLET BACKED STARLING, VIOLET CUCKOO, VIOLET GREEN SWALLOW and VIOLET TURACO seem to be converted into a single group named VIOLET, and the same is true for other names that have the same word in their names before the space character. Generally speaking, it is a terrible idea to use space characters in file or directory names in python scripts. So you may want to fix this first before determining how accurate your training actually is. I suggest you simply replace space characters in file/directory names with underscores. Beyond that, your approach seems OK. There are other small networks that could work similarly well in terms of accuracy but give smaller models, such as SqueezeNet and MobileNet.

normal isle
#

Hey people my team tries to make the tts model emphasis better, do any one has ever made or see any kind of research like that arround ?
if it is there we can take some help

#

as far what we tought is appliyng ssml tags but it is no use for programmatic tasks and the other way was appliyng the original pitch from the original data to tts output I'm open to discuss

dusky ember
#

I've been experimenting with compressing EfficientNet models https://www.cranberrygrape.com/machine learning/tinyml/bird-detection-tinyml/ I documented my approach on my site. My test images on my edge impulse project have been mostly successful (or close enough where I see the resemblance). Was able to take my model from 4,491,895 parameters and drop it to 196,533 while retaining some portion of the accuracy (95% - 82%). I further refined the outputs to just birdfeeder ones and later quantized it for a final model of 96x96 with 411 outputs and 190,770 parameters with 82% int8 quantized accuracy. Not sure if there are any flaws to my approach but left enough in my notebooks and page so folks can reproduce as they desire if there's interest.

Cranberry Grape | Cosmic Bee | Tim Lovett

Obsessively Shrinking a Transfer Based Model

sinful patio
sinful patio
lost hollow
ember orchid
junior canopy
#

Hi, I am new to research area. Need some mentorship to start in NLP. kindly let me know if anyone can guide. Thanks

dusty dawn
#

Hello everyone, i have a question and i would really appreciate your assistance.pika_wow
I have 2 networking and ip addresses data files with .RR format (ex: myipv6add.RR, myipv6add2.RR) and i want to extract into MySQL file .. how can i write a script in python to do that ? harold

cursive flicker
#

hi everyone , I have done my btech in Data Science but I am new in kaggle discord community @admin if there any suitable work I would be happy to work on it. Let me know if anyone can guide me its functionalities

wide portal
#

Hello, everone!
I need help.

wide portal
#

I want to make a new song in the same style from 100 ambient music. These are all made by several instruments but not from human voice.
How can I do this?

ember orchid
wide portal
ember orchid
#

Those interested in Reinforcement Learning can check our recent work out

http://arxiv.org/abs/2408.14195

TLDR: We analyse a clustered multi-armed bandit formulation, where the learning objective is to identify representative arms from each cluster, in a fixed confidence setting

vestal fable
#

Hi, everybody. I have a question.
I want to make a method to architecture the neural network for given real problem.
Is this possible?
So, I mean can we make the certain arhictecture of network based on neuro science?
Please help me overview of this and methods.
Where I can find the proper references?

grand shuttle
#

Hi. I discovered a way to build a logical digital mind. Looking for you folks who are interested in discussing the idea๐Ÿค

long ridge
#

Hi everyone , i recently submitted a paper which got accepted, The paper is published on arxiv : https://arxiv.org/abs/2410.13293 , do check it out and i would really appreciate everyone's feedback on the paper.

my linkedin : https://www.linkedin.com/in/prakhardixit250697/

lucid echo
#

congrats! that's awesome

stoic wraith
# long ridge Hi everyone , i recently submitted a paper which got accepted, The paper is publ...

Hi man! I just finished reading your research paper on SBI-RAG, and I'm really impressed with your work. The way you combined Schema-Based Instruction with the Retrieval-Augmented Generation framework using a large language model is incredibly innovative. I love how your approach helps students break down math word problems into clear, logical steps - it's such an important skill for them to develop. The results you achieved are remarkable, with your system outperforming other AI models in terms of reasoning quality.

I also found it fascinating how you used another AI as a judge to evaluate the responses, and it's great to see that your system consistently scored higher in clarity and logical flow. While there might be some room for improvement, like incorporating more diverse datasets and human evaluations, I think your research has the potential to make a real difference in how we teach math problem-solving.

Keep up the fantastic work man!

I'm new to research field, but i'm interested to do join on research team and learn. if i have opportunities to join any your termed research team, please let me know.

long ridge
#

Thanks for the comments @stoic wraith , i am glad you liked it . Right now we dont have opportunities but if any thing comes up will let you know

stoic wraith
#

Alright

stoic wraith
little jungle
#

anyone doing any research work on nlp especially zipf's and heap's law ?

long ridge
#

Is anyone doing research here on how to enhance math reasoning in LLMs?

glad sigil
#

anyone taking part in Meesho competition here?

pseudo lodge
#

๐’๐ญ๐ž๐ฉ๐ฉ๐ข๐ง๐  ๐ข๐ง๐ญ๐จ ๐ญ๐ก๐ž ๐‘๐ž๐ฌ๐ž๐š๐ซ๐œ๐ก ๐ƒ๐จ๐ฆ๐š๐ข๐ง ๐Ÿ๐จ๐ซ ๐ญ๐ก๐ž ๐…๐ข๐ซ๐ฌ๐ญ ๐“๐ข๐ฆ๐ž !

I still remember the day Krupali Donda ma'am introduced myโ€ฆ

iron loom
#

Hi!
I'm a senior data scientist from Armenia (31yo) with 4+ years of experience and two master's degrees (RIT-NY, USA; AUA-AM). Last year, I applied to several top PhD programs but was unfortunately rejectedโ€”likely due to not having any publications yet, which I'm eager to change.
I'm looking to collaborate with a researcher or professor on AI topics, especially:

  • Multimodal machine learning
  • Autonomous AI agents
  • Knowledge graphs
  • Memory-augmented architectures
  • Reinforcement learning
    I've worked in both industry and academia, but honestly feel most at home in academic environments, surrounded by like-minded people. Started my data science journey at 24 (after military service) and have been playing catch-up ever since. While I'm doing well career-wise, I have a strong drive to push further and contribute to meaningful research.
    I'm aiming to build up research experience and publish papers before reapplying to PhD programs in December 2026. It's tough to find people working in these areas in Armenia, so I'm reaching out here.โ€จ
    Would love to discuss potential projects or hear any advice on connecting with professors/researchers. I think about just reaching out to authors of the papers I am reading but not sure whether it is good idea.โ€จFeel free to DM
    Thanks for reading ๐Ÿ™‚
cloud jetty
#

hello guys
please i have machine test to take. thats job hunting. can someone help by doing it together through google meet?

long ridge
#

https://paperswithcode.com/paper/sbi-rag-enhancing-math-word-problem-solving: check out the work I recently did , the paper was accepted at NeurIPS 2024 in Vancouver, Canada! ๐Ÿ‡จ๐Ÿ‡ฆ๐ŸŽŠ This research is all about making math word problems more approachable for students using structured, schema-driven steps ๐Ÿ“š๐Ÿง ๐Ÿ’กโ€”like a step-by-step guide thatโ€™s just like how teachers walk kids through problems! ๐Ÿ‘ฉโ€๐Ÿซ๐Ÿ‘จโ€๐Ÿซ

Whatโ€™s even more exciting? This approach doesnโ€™t just help students; it also enhances reasoning in LLMs by triggering those crucial intermediate steps for better solution paths ๐Ÿงฉ๐Ÿค–. the paper was accepted at NeurIPS 2024 in Vancouver, Canada! ๐Ÿ‡จ๐Ÿ‡ฆ๐ŸŽŠ This research is all about making math word problems more approachable for students using structured, schema-driven steps ๐Ÿ“š๐Ÿง ๐Ÿ’กโ€”like a step-by-step guide thatโ€™s just like how teachers walk kids through problems! ๐Ÿ‘ฉโ€๐Ÿซ๐Ÿ‘จโ€๐Ÿซ

muted pond
#

PyGen: A Collaborative Human-AI Approach to Python Package Creation

PyGen is an open-source tool designed to automate the generation of Python packages from user-provided prompts. By leveraging advanced language models, PyGen streamlines the development process, producing packages complete with testing and documentation. This approach has been applied to create tools such as AutoML (automated machine learning), AutoVision (computer vision), AutoSpeech, and Quantum Error Correction utilities.

Key Contributions:

Automated Package Generation: PyGen simplifies the creation of Python packages by generating code, tests, and documentation based on user inputs.

Advanced Language Model Integration: Utilizes sophisticated language models to interpret prompts and produce relevant code structures.

Versatile Applications: Demonstrated effectiveness in developing diverse tools, including AutoML, AutoVision, AutoSpeech, and Quantum Error Correction utilities.

Resources:

Paper: https://www.arxiv.org/abs/2411.08932

GitHub Repository: https://github.com/GitsSaikat/PyGen

License: MIT License

We hope this tool proves useful for your projects. Feel free to explore the resources and share your feedback or questions.

GitHub

Generate Python Package with Simple Prompts. Contribute to GitsSaikat/PyGen development by creating an account on GitHub.

reef frigate
violet comet
#

Title: Seeking Turbulent Flame Image Dataset

Dear Kaggle Community and Esteemed Researchers,

I am currently engaged in a research project that involves analyzing turbulent flame dynamics, and access to a Turbulent Flame Image Dataset is crucial for my progress. Despite exploring various repositories and reaching out to the corresponding author of a related study, I have not received any responses or found a source for this dataset.

As someone passionate about advancing understanding in this field, I deeply admire the research efforts of those who have contributed to this domain. If anyone could guide me toward obtaining this dataset or provide access to similar resources, it would be invaluable to my work.

Your support could significantly contribute to my research, and I assure you that the dataset will be used with the utmost respect for its intended purpose, with proper acknowledgment given to its creators.

Thank you for considering my request. I look forward to any guidance or support you can provide.

Best regards,
Maimunul Karim Jisan

quiet gale
#

๐Ÿš€ Dive into the Future of AI with Marco O1!

Iโ€™ve just published a comprehensive deep dive into Alibabaโ€™s groundbreaking AI model, Marco O1, designed for open-ended reasoning. This article unpacks how Marco O1 is setting new standards for developers and innovators with its cutting-edge capabilities.

Whether youโ€™re an AI enthusiast, a developer, or just curious about where open-source tech is heading, this piece covers it all โ€“ from core functionalities to its game-changing applications in the real world.

๐Ÿ”— Check it out here: https://www.linkedin.com/pulse/marco-o1-alibabas-advanced-groundbreaking-ai-model-nalkheda-wala-uhkmf

๐Ÿ’ก Trust me, this is more than just an overview โ€“ itโ€™s a must-read deep dive for anyone passionate about the future of AI!

Feel free to share your thoughts โ€“ would love to know how you see this impacting the tech landscape! ๐Ÿš€

Explore Alibaba's Marco O1, a groundbreaking AI model for open-ended reasoning. This in-depth analysis covers its capabilities, impact on developers, and future

vestal fable
# quiet gale ๐Ÿš€ Dive into the Future of AI with Marco O1! Iโ€™ve just published a comprehensiv...

Good articles, recently, the performance of open source models increase and that makes us allow customization.
Now, I'm doing the project related to AI call agency using APIs and it fails to understand the business logie during interactions with clients.
MCTS and COT is good framework for reasoning, but in real time applications such as call agency, of course that is not complex work, but I want to know the possibilities how we can apply Marco O1 to call agency.
Thanks.

stoic wraith
#

Hello, I'm a competitive programmer with a passion for Machine Learning. Over the past 8 months, I've been diving into supervised and unsupervised learning, deep learning, and NLP. Though I'm new to research on AI but I bring dedication, a strong work ethic, and a mix of ML and programming skills. I'm excited to join a research team where I can contribute, learn and grow alongside experienced members.

fossil osprey
#

hello i dont know if i am in the correct server but i am looking for ai project that i can join possibly write a paper by the end for my thesis please contact me if you are interested

raw eagle
#

If anyone is publishing papers either in LLM's or Computer vision please do include me iam too much interested to work due to lack of network iam unable to do things that what I want to do
Please please do include me if anyone is publishing papers ๐Ÿ™๐Ÿ™๐Ÿ™๐Ÿ™

fossil osprey
left ocean
#

Hi everyone, Can anyone help me with a list about time series analysis models that can be used in demand and sales forecasting because I am build a research paper about it

reef frigate
#

And you can checkout timegpt

reef frigate
#

Hello everyone, Is there anyone doing research on ai based BCI or interested in that domain let me know thanks

charred tapir
#

Hi everyone ,is there anyone who is targetting for december acl

reef frigate
#

anyone know free peer reviewed publication journal in AI/ML/DS instead of arxiv?

vestal fable
#

Hi, everybody.
I'm looking for ar related to project where that can detect palne and argument the objects on that.
If there is anybody who knows, please tell me.
Thanks.

idle spire
#

Hello everyone,
is anybody here who is familiar with AI development without frameworks?
I was able to build an image classification AI with GPU - acceleration (CuPy), but struggling with multiple classification (more than 5 classifications).

In case someone is experienced in that field, I am looking forward for cooperation. It might be fun for more under the hood experience.
Thank You.

vestal fable
#

Hi, everybody.
We're building news analysis models and need to collect news data of 20 years.
Is there anybody who knows news data service well?
Please tell me.

outer nest
#

did anyone try creating a text translation model, like from english to some unique language!
Let's say we want to create a model that translate English to LangX.
Any ideas?

tropic maple
lusty knoll
#

From my experience it's actually more effective to research the underlying transformer architecture, then any model of interest.

At the end of the day all a model is, is a choice of attention layers, optimiser including any relevant hyperparameters and tokeniser

orchid gorge
#

Hello

#

I am here in Kaggle.

golden knot
#

I hope this is the right channel to post about the LLM Research Cohort we are going through in **Cohere For AI **Discord. Join in if y'll find this interesting -
https://x.com/cataluna84/status/1877689686639992872

From the BIRDS(Beginners in Research Driven Studies) organized by @akankshanc of @cohere Open Science Community, we're thrilled to announce our new LLM Cohort! ๐ŸŽ‰ ๐Ÿš€

This isn't just another learning program; it's a hands-on, collaborative research initiative designed to push the

floral crest
#

Hello Everyone!
My name is Rushikesh and I am working as a data scientist. I am deeply interested in contributing to research in the data science space and would be thrilled to collaborate on any ongoing research projects you guys may have.

I have expertise in the following relevant areas:

Training Neural Networks
RAG
Fine-tuning LLMs
Python (Flask, FastAPI)
Prompt engineering
NLP

I am looking forwared to networking and collaboration

haughty vapor
halcyon cliffBOT
#
masooma_63162 has been warned

Reason: Bad word usage

proper tide
halcyon cliffBOT
#
masooma_63162 has been warned

Reason: Bad word usage

#
masooma_63162 has been banned

Reason: Too many infractions

mystic pumice
#

Building Recommender systems with Gaussian Mixture Model (GMM) and KMeans

1- Data preparation
2- Standard Scaling
3- PCA
4- KMeans: Uses hard clustering (each point belongs to exactly one cluster). Computationally faster than GMM.
5- GMM: Uses soft clustering (each point has a probability of belonging to each cluster). More flexible but computationally heavier.

kaggle: https://www.kaggle.com/code/omidsakaki1370/gaussian-mixture-model-gmm-and-kmeans

muted pond
cedar geyser
#

greetings , i am looking forward to write some research papers if u have any tips or u can help me in anyway pls dm . i am willing to become co - author in research papers aswell

wide portal
#

Hello everyone! Iโ€™m planning to write an article about formal verification of machine learning models. If anyone knows related articles on this topic please share them with me.

trail warren
#

Hello guys, I'm working on a project about Artificial intelligence system to enhance MRS and detect brain pathologies and I'm looking for papers/books about this topic, do anyone know anything about that? Thank you!

tacit flame
# trail warren Hello guys, I'm working on a project about Artificial intelligence system to enh...

Hello! Hope things are fine there! If you go to Scopus( Elsevier), Science direct, Wylley, Nature, you will find the best papers about.

https://www.elsevier.com/connect/how-life-sciences-researchers-regard-and-use-ai

https://www.scopus.com/home.uri

www.elsevier.com

Elsevierโ€™s new report explores AI attitudes among researchers and clinicians; co-author Adrian Mulligan comments on its significance for corporate R&D.

tacit flame
tacit flame
trail warren
tacit flame
tacit flame
tacit flame
tacit flame
wide portal
regal swallow
#

Hello, I am looking for a research opportunity. Please ping me if anyone have it.

hoary lake
#

Hello guys i am looking to publish my first research paper , what should i research on any particular new topic in which there is not yet much research and is easy

limpid ocean
#

Hey, i need help

We have long conveyor like 5 kms long
In which there are idlers around 4 to 5k
We divided conveyor section with imaginary line let's say every 20m , in that there are around 10-12 idlers of same dimensions.

We have normal data that is after replacement of idlers, and have abnormal data that is before replacement , we have dataset in the form of real positive fft that is each row contains list with 5k integers

If i train 1dcnn based auto encoder or vae it works section wise, like I can see higher reconstruction error in abnormal data. But it is impossible to create model for every section it will be computationally very expensive. I want single model that will work entire conveyor, but when I combine all data and train then it won't generalise well.

Also I tried extracting statistical features like kurtosis , skewness etc and trained dense vae but no luck what can I do ?

Note: i can see abnormality in normal data too. Even after cleaning it becomes more sensitive to normal data as well tell me better approach if you have any experience related to similer problem

compact cosmos
#

I need a little help in my project anyone who is having good experience in image enhancing and preprocessing

fervent pendant
#

๐Ÿ“ข Besoin dโ€™un Endorsement ArXiv pour publier en cs.HC (Human-Computer Interaction)

Hey everyone,

I'm trying to submit a paper to arXiv under the cs.HC (Human-Computer Interaction) category, but I need an endorsement from someone who has published at least 3 papers in cs.HC, cs.AI, cs.LG, or a related category in the past 5 years.

๐Ÿ” Paper Title:
๐Ÿš€ HistoAgent: A Fine-Tuned RAG-based AI Assistant for Historical Knowledge

It introduces HistoAgent, an open-source AI assistant leveraging Retrieval-Augmented Generation (RAG) and fine-tuned with QLoRA for historical research. The system integrates DeepSeek-7B, Qdrant, LlamaIndex, and RLHF for improved contextual accuracy in history-related question answering.

๐Ÿ“Œ Why this request?
arXiv requires new authors in certain categories (like cs.HC) to be endorsed before submitting. My endorsement code is: Y4CJHX.

If anyone here meets the criteria and is willing to endorse me, Iโ€™d really appreciate it! ๐Ÿ™

I can provide more details if neededโ€”feel free to DM me. Thanks a lot! ๐Ÿ˜Š

ember orchid
reef frigate
#

I'm looking for novel research ideas in ai in healthcare with potential research gap. Can anyone help me to find out it?

reef frigate
quiet gale
#

๐Ÿš€ ๐—”๐—œ ๐—ถ๐˜€ ๐—˜๐˜…๐—ฝ๐—น๐—ผ๐—ฑ๐—ถ๐—ป๐—ดโ€ฆ ๐—•๐˜‚๐˜ ๐—”๐—ฟ๐—ฒ ๐—ช๐—ฒ ๐—™๐—ผ๐—ฟ๐—ด๐—ฒ๐˜๐˜๐—ถ๐—ป๐—ด ๐˜๐—ต๐—ฒ ๐—ฅ๐—ฒ๐—ฎ๐—น ๐—ง๐—ต๐—ฟ๐—ฒ๐—ฎ๐˜?

As an ๐—”๐—œ ๐—˜๐—ป๐˜๐—ต๐˜‚๐˜€๐—ถ๐—ฎ๐˜€๐˜, ๐—ฃ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜ ๐—˜๐—ป๐—ด๐—ถ๐—ป๐—ฒ๐—ฒ๐—ฟ, ๐—ฎ๐—ป๐—ฑ ๐—”๐—œ ๐—ฅ&๐——...

patent fossil
#

Hello everyone, I currently having some experiments that concerning biometric template protection. AFAIK, there are two schemes for biometric template protection which are cancelable biometrics and biometric cryptosystems. Here I am interested in the biometric cryptosystems, such as fuzzy commitment scheme (FCS), I wonder does anyone here have tried or have implemented those concept (FCS) in their projects? If so, i would like some discussions.

Thanks in advance!

lavish hare
#

Hi everyone !
i m an ai research enthusiast ; want to learn together and discuss research papers ?

brittle nova
#

Hello Everyone,

I am looking to collaborate on publishing a research paper with someone experienced.
I am a Developer with 10+ years of experience. ๐Ÿ‡บ๐Ÿ‡ธ
I am finishing my PhD next year.

https://www.linkedin.com/in/akshaymittal143/

lament saddle
#

Hey! Iโ€™m building an AI startup focused on LLM finetuning and RAG.
Looking for a partner with some experience in this area to join the journey.if ur intrested please dm me

unreal flower
#

Hello everyone from Nigeria

vital hornet
#

Anyone used NotebookLM how is it?

regal swallow
#

Hello, I am looking for a research opportunity. Please ping me if anyone have it.

olive vine
#

Hi guys, i was wondering if my research idea sounds feasible as a 16 year old junior in High School: What geometric properties of a machine learning loss landscape help explain why some networks generalize better than others? Does anyone have any tips on how I should begin

lusty knoll
#

Hi,

I am half way through my bachelor of data analytics degree presently, here in Australia.

Every time I go to find a new Kaggle dataset to practice on, I find myself considering many ideas and hypotheses, that are hard to test due to limitations with the data, I seem to find myself considering many connections across multiple domains such as economics, psychology, and the relevant subject matter.

I'm thinking, the most effective way for me to get comfortable in machine learning, is to work on some kind of research project. If anyone is working on anything, or would like to potentially collaborate to contribute insights, let me know.

I'm interested in pursuing a PhD at some point afterwards, but am leaning more towards finding work for a couple of years first after my current degree. Even if you think connecting with me might be worthwhile, reach out ๐Ÿ™‚

dusky moss
long ridge
#

Hi Everyone, I am a 2nd Year PhD student in Computer Science at University of Maryland Baltimore County specializing in Machine Learning, Reinforcement Learning, and Mathematical Reasoning in LLMs. I was thinking to write a Review paper on the current Maths Reasoning in LLMs , so was looking for potential collabrators on it. Thanks

spare atlas
sacred hound
#

What kind of software would you use for collecting ongoing self-report data from subjects, which would provide subjects with anonymity, but also keep their consecutive reports together under their subject id?

dusky moss
#

I am banned from the following tech discord channels: Microsoft, Google, Tesla, and the OpenAi discord channel.

The reason?

Because of this:

https://www.kaggle.com/code/alshival/is-tesla-s-fsd-missing-the-road-for-the-trees

Back in the day, people were defending Elon Musk and they pushed me out.
Nowadays, I think my ban wouldn't have happened because people finally got tired of him.

lusty knoll
#

Is there any research being done on using GAMs on feature importance to obtain the maximum importance per feature? (for example, x has a sensitivity rating, it could be the same, less, or more than the sensitivity of the sensitivity of (x) + (x^2)

This should help increase the ceiling of models accuracy scores, while providing insights for a better accuracy vs resource cost ratio as well ๐Ÿค”

Given features are being transformed not removed, permutation importance isn't needed, and GAMs already account for the small change that one feature transformation can have on the other features. To account for the cost of getting a GAM per feature, reduced samples should still yield adequate results

Edit: Okay yeah so I learnt about different GAM models that remove the need to transform features in such a way because they already learn the best representations, saves me a lot of time at least xD

I can use an EBM solely for the feature transformation process instead >:3

lusty knoll
# lusty knoll Is there any research being done on using GAMs on feature importance to obtain t...

On this, I've decided to include EBMs solely for finding the best representation of features with respect to the target variable. This is similar to DecisionTreeEncoder, except it treats all variables as dependent, as in it's multivariate.

My priority is on my framework entirely, but if someone would like to actually test this, even formalize the results if you wish, let me know, I'd be interested ๐Ÿ™‚ this means that only one EBM is needed, rather than a GAM for every variable XD

Edit: So by transforming the features with respect to an EBM, it's basically the same as producing the artifact of the EBM itself, so it's a redundant process in the end.

(For those who use EBMS, I strongly suggest clustering + aggregating + encoding data to reduce the samplesize drastically, this is not gpu-accelerated). If I had clients paying for my services, I wouldn't even consider using XGBoost with this beast available

lusty knoll
#

Research idea for anyone who wants to try it:

Try featureengine' DecisionTreeEncoder, but modify it to use LASSO (and/or elastic net), to account for undersampled classes (be particularly careful for data leakage, if you haven't used DecisionTreeEncoder before).

Let me know the results if you decide to give it a go ๐Ÿ™‚

tropic rose
#

Hey everyone!
I'm a PhD student, and together with my research group from Politecnico di Milano (Italy), we're studying how experts share ML models and which aspects they care most about during the sharing process.
As part of our research, we're collecting quick feedback through a short Google Form โ€” just a few questions, and it should only take 1โ€“2 minutes to complete.

Weโ€™d really appreciate your help!

Hereโ€™s the link: https://forms.gle/GUudDztkFsnqE3Qc8

If you have any questions, feel free to reach out to us at gabriele.digregorio@polimi.it or/and marco.digennaro@polimi.it. Thank you!

noble jungle
#

๐Ÿ“ฃ Call for Papers: 5th Muslims in ML Workshop @ NeurIPS 2025 ๐ŸŒ๐Ÿค–
Weโ€™re excited to announce the 5th Muslims in Machine Learning (MusIML) Workshop, co-located with NeurIPS 2025! This year, weโ€™ll be gathering on Tuesday, December 2nd, 2025, at the San Diego Convention Center, USA. Join us for a day of vibrant discussion, research, and community at the intersection of Machine Learning and Muslim communities.
.
๐Ÿง  Who should submit?
This is an open call for anyone working on ML that aligns with the goals of the workshop. We especially encourage submissions from researchers who self-identify as Muslim, and those engaging in community-focused or ethics-aware research.
๐Ÿ’ฌ Topics of Interest Include:
ML for social good (education, health, governance)
Language and speech tech for Arabic, Urdu, Persian, etc.
Digital analysis of Islamic texts (e.g., Quran, Hadith)
Bias mitigation, fairness, and responsible AI
LLMs, generative models, federated learning
Robotics, NeuroAI, and ML infrastructure

Important Dates
Visa-friendly submission deadline: August 22, 2025 (abstract, August 15, 2025)
Visa-friendly notification of decision: August 31, 2025
Regular submission deadline: September 15, 2025
Regular notification of decision: September 30, 2025
Camera Ready submission: October 14, 2025

For more details
Website: https://www.musiml.org
NeurIPS CFP: https://www.musiml.org/events/2025-NeurIPS/cfp.html
Facebook Page: https://www.facebook.com/musiml.org/
.
๐Ÿ—“๏ธ Workshop Date: December 2, 2025
๐Ÿ“ Location: NeurIPS 2025, San Diego, California

inner canyon
#

Hey, if you are a builder/creator can you please spare 10 mins of your time to answer this survey: https://forms.gle/D1CXQV5Y8fwPzYibA This would be a great help ! Please do send this forward to your creator/builder friends.. Thank You ๐Ÿ™‚

lusty knoll
#

A small-scale research project if anyone has the time would entail a new way to calculate feature importance for tree-based models.

That is, similar to how feature importance is calculated in the scikit-learn module, except you account for the second best choice across each split, getting the difference in variance reduction.

It's naive, but less so than the current way it's calculated. Naive, because with a different split, the downstream splits would also be impacted. The results should be compared to the traditional feature importance value, and literal feature importance values calculated from one-off recalculations of the tree absent of the control variable, and comparing the difference.

There's a much more complex project that would rely on the results of the above, so if anyone decides to try the above, let me know ๐Ÿ™‚ otherwise, I'll get around to it at some point

lusty knoll
#

Right, I'm working on the above now. Will post a link with my findings when I am done.

Another idea is having decision trees be dynamic, instead of controlling overfitting by specifying the max number of leaf nodes, samples etc, you could form the tree with an internal holdout set (not the true holdout set), and when a split occurs where the distribution of samples is unusually higher in one child node than the other, you then use that holdout set to see if it's legit, or if it's fitting noise.

Might work on this after the above. (This could also mean a new optimization approach where you actually want more of these high-stakes splits, and could yield more powerful trees of a lesser size)

lusty knoll
# lusty knoll A small-scale research project if anyone has the time would entail a new way to ...

Interesting results from this so far..

Another avenue of research I will investigate is how it compares to LASSO and Ridge for managing complexity, by adjusting the split to use (Ginior variance reduction) * lambda * (my normalized metric).

The metric effectively asks two questions: "How necessary is this feature in constructing the tree", and "How uncertain is the model for this split?" ๐Ÿ™‚

lusty knoll
#

Still need to add it to PyPI, but here's a unique partitioning approach with its own custom metric to measure the quality of partitions by way of the average HHI with respect to the variance proportions per leaf-node.

It turns the decision tree into an unsupervised global variance reduction model by getting the sum of the z-scores of continuous values, and trying to minimize it (while original columns are left untouched).

I tested it on 1 million samples, 5 features down to 20,000 buckets. 0.0015 HHI, only took 7 seconds to process. Enjoy ๐Ÿ™‚

https://github.com/HotProtato/H-VRT

frail ruin
lusty knoll
# frail ruin if you reduce global variance then almost all leaf have similar values. does it ...

All leaf nodes make up a tiny portion of the global variance for every feature. It's not for predicting, but partitioning.

Personally I'll be expanding upon it in another framework that will both allow for inference, but will still be designed to be used with a downstream model

Also by global variance I mean the goal is to reduce variance across all features instead of reducing the variance of a single y value; global variance with respect to y.

The global variance of the features is the same, but is represented by leaf nodes as small partitions

loud rose
#

Hi, @everybody
I have one question, I'm training ml models for the prediction, which is classification problem of 3 classes, where the number of samples are similar but the predition is skewed.
First class and second class is predicted with low precision tough, third class is never predicted. What's the reason? I can' t find the reason.
Before, when I applyed reinforcement learning, where the three classes were assigned to three actions and one action is never selected, too.
Actually, that is the preeiction model of forex eur/usd.

lusty knoll
# lusty knoll Still need to add it to PyPI, but here's a unique partitioning approach with its...

If anyone wants a potential paper to do, you could use this framework in 2 ways (for now):

  1. As an imputer, (there's a few challenges to this, but could be very worthwhile).
  2. As a complexity measure. The required hyperparameters with the HHI metric could measure a dataset's complexity in terms of how multivariate it is. Rigorous testing will be needed to establish baselines.

Having to continue with university for now, so I'll have my hands full. If you have any questions or an interest in that, let me know ๐Ÿ‘ผ

A similar point to #2, which might be a good comparison or cheaper global measure, is using HPO + Lasso or Ridge with a tree model, the higher the coefficient, the more complex the data

Edit: (I don't know why I keep thinking of these ideas, my brain doesn't let me rest T_T)
3. You could make a greedy selection problem, where you select leaf nodes and their neighbours such as that the HHI is reduced as a clustering technique (which can be supervised or unsupervised)

lusty knoll
lusty knoll
# loud rose Hi, @everybody I have one question, I'm training ml models for the prediction, w...

Just on this; I tested my H-VRT framework for its potential as a anomaly detection model. So far precision is slightly less than isolation forest, but the recall is significantly greater than isolation forest, thereby with Bayesian inference, yields more favourable results.

You could look at adapting that framework to your needs if you like; it seems to be natively robust against underrepresented classes.

I'm focusing on university for now, but when I get the time I have a strategy to place the variance reduction and HHI (Gini impurity) on the same scale, and to use this for confounder analysis.

In fact, by using my normalized importance metric on both from earlier, I expect to use it to solve the n! problem in casual inference, by only focusing on the relationships that are important. It'll be trivial to actually store all n! information, but it's about extracting the important information that will be more annoying to develop xD

Once I adapt the normalized metric with the categorical and continuous values being on the same scale (with the same target encoding functionality to manage ordinal and nominal categories in the same way), due to the multiple y-target with my split criterion, many OLS models through vectorization can be fitted on the continuous values in the leaf nodes.

The result? You can see through a kernel density plot, the reconstruction of every feature given every other feature, effectively within a system, you get an aggregated view on how features are impacted by other features including categorical.

To get the actual extent of how much features impact other features, my normalized importance metric asks "How important was this feature in constructing this tree?", most importance metrics asks "Given this fitted model, how much does this matter for inference accuracy?"

#

Sorry I get kinda hyped up about all this stuff xD if any of you want more info, or would even like to help, feel free to msg ๐Ÿ˜

loud rose
#

I know causal discovery and inference and am interested in you, and Iโ€™d love to talk with you about it in detail. Iโ€™m a bit busy right now, but hopefully we can chat about it next weekend.

loud rose
#

Hi, @everyone
Is there anyone who joins radical ai founders' masterclass?
I didn't have an opportunity to apply for that.
Please give me the meeting urls for them.

lusty knoll
hallow lodge
#

Hello Guys

I'm planning to develop a completely open-source LLM,which is free to use(and free to develop,since the costly prices of GPU's).Currently, I need the following things:

  1. Board Members to work with my level of humor,and comfortable with my constraints and my age.
  2. Independent Suppliers(Cloud Computing, Data storage etc).
  3. Advisors

I am currently experimenting on 2x T4 GPU's in kaggle.
Here is my discovery lately:

  1. tiktokenizer works the best, every attempt of scratch-bpe has failed.
  2. Need to use AdamW8Bit from bitsandbytes to optimize.
  3. We need a HUGE data.

Kaggle: afifalisaadman
HF: Afifsudoers

Contact me for further inquiries( Development of the Project will begin in 9th December respectively)

golden gull
#

heyy

river canyon
#

๐Ÿ† Attention Kaggle Researchers and AI Builders!

If youโ€™re passionate about uncertainty calibration, AI safety, and building models that know when they donโ€™t know โ€” the Aletheion Research Collective invites you to join our open community.


๐Ÿค– What is Aletheion?

Aletheion is a research project exploring epistemically safe AI โ€” architectures that embed humility directly into intelligence.
Itโ€™s inspired by the paper How to Solve Skynet: A Pyramidal Law for Epistemic Equilibrium, which introduces:

  • Epistemic Softmax โ€” a calibrated replacement for softmax.
  • Q1/Q2 uncertainty gates โ€” distinguishing between aleatoric and epistemic uncertainty.
  • Fractal training (VARO) โ€” enabling multi-scale reasoning about confidence.

Our goal: build models that are not just accurate, but self-aware about uncertainty.


๐Ÿงฉ Why Join Us?

  • ๐Ÿง  Collaborate with AI researchers, data scientists, and philosophers worldwide.
  • ๐Ÿงช Share and reproduce experiments on calibration, robustness, and alignment.
  • ๐Ÿ’ก Explore open-source code implementing Epistemic Transformers.
  • ๐Ÿ“Š Turn Kaggle insights into epistemic benchmarks for AI trustworthiness.

๐Ÿ”— Join the Movement

๐Ÿ’ป GitHub: github.com/AletheionAGI


๐Ÿ•Š๏ธ Final Note

"The next frontier in AI isnโ€™t bigger models โ€” itโ€™s models that understand their own limits."
Join Aletheion โ€” and help design intelligence that knows when it doesnโ€™t know.

stray wasp
#

Hi

kind dawn
#

Research Collaboration Opportunity โ€“ Clinical Proteomics ร— LLM Reliability

We are conducting a study titled โ€œHallucination Risks of Large Language Models in Clinical Proteomics.โ€
The project systematically evaluates how models such as GPT-4, Claude, and Gemini perform when interpreting clinical proteomics data, focusing on hallucination frequency, error patterns, and reliability assessment.

Our results indicate that even frontier models generate 27โ€“35% factual errors, rising to over 50% for complex or rare-protein queries.
These findings highlight the significant reliability and safety challenges of applying LLMs in biomedical contexts.

We are seeking a collaborator who:
โ€ข Has experience working with proteomics or mass-spectrometry datasets
โ€ข Understands LLM architectures, evaluation frameworks, or AI safety

If this aligns with your expertise or interests, feel free to contact me or reply here.
Code, datasets, and the full evaluation pipeline are available on GitHub:
https://github.com/olaflaitinen/llm-proteomics-hallucination

Tags: #AIResearch #Bioinformatics #LLM #ClinicalData #ResearchCollab

river canyon
grizzled cove
#

I have built an iOS/Android app that allows a user to manually predict time series data thatโ€™s used to train an ai/ml model. I want to run a kaggle competition that would involve users comparing their trained model to their manual predictions.

river canyon
#

๐Ÿš€ After 6 months of building, I'm excited to launch AletheionGuard

The problem we're solving:

Companies are deploying AI (chatbots, RAG apps, agents) in production without knowing when their models are generating incorrect information.

This is especially critical in:
๐Ÿฅ Healthcare - Wrong medical advice
๐Ÿ’ฐ Finance - Incorrect market analysis
โš–๏ธ Legal - Unsupported claims
๐Ÿค Customer Support - Wrong product information

Our solution:

An API that quantifies epistemic uncertainty in LLM responses. In simple terms: we tell you when your AI is making things up.

How it works:

  1. Your app gets a response from an LLM
  2. Send prompt + response to our API
  3. Get back confidence scores and recommendations
  4. Decide whether to show, flag, or reject the output

Real impact:

  • One healthcare client reduced incorrect answers from 23% to 4%
  • A legal tech company now catches 85% of unsupported claims
  • A customer support bot knows when to escalate to humans

We're offering a free tier (1,000 requests/month) so teams can test it risk-free.

If you're deploying AI in production and care about reliability, I'd love to hear your thoughts.

Try it: https://aletheionguard.com

What challenges are you facing with AI accuracy in your organization?

hashtag#AI hashtag#Enterprise hashtag#Technology hashtag#Innovation hashtag#Startup

little jungle
cloud night
#

Is anyone interested in working on a research project or do you an idea you would like us to work on?
Kindly dm for more discussion.

halcyon ermine
#

Iโ€™m planning a comprehensive SHAP analysis and explainability on this xLSTM(based on only mLSTM) model: https://huggingface.co/stefan-it/xlstm-german-wikipedia
Main goals:
โ€ข Understand how the model makes predictions through feature attributions
โ€ข Explore how the mLSTM memory mechanism works under the hood
โ€ข Visualize what the model โ€œpays attention toโ€ when processing text
Any advice on the best approach to tackle this? Would appreciate suggestions on tools, methods, or workflows that work well for this kind of analysis.
Thanks!

lunar basalt
#

Hey everyone! ๐Ÿ‘‹
Iโ€™m conducting a short academic survey for my Research Methodology internal assessment on โ€œThe Impact of ChatGPT in Education.โ€
It takes less than 3 minutes to complete and all responses will remain anonymous.
Your input will really help me with my project โ€” please fill it out below ๐Ÿ‘‡

๐Ÿ”—Survey Link

Thanks a lot for your time and support! ๐Ÿ™

weary jackal
#

Hi @everyone ๐Ÿ‘‹

For the last 4 months, Iโ€™ve been heads-down building QuintNetโ€”my own distributed training library built from scratch in PyTorch.

My goal was simple (and naive): Stop treating DDP and Megatron-LM as black boxes and actually understand the physics of distributed training.
It implements full 3D Parallelism (Data + Tensor + Pipeline) on a custom GPU mesh.

The "Fun" Part (aka The Struggle):
๐Ÿ› Debugging silent NCCL hangs that gave zero error messages.
๐Ÿ“ Chasing tensor shape mismatches across ranks in the pipeline.
๐Ÿ•’ Realizing that implementing 1F1B scheduling correctly is... harder than the papers say.

I finally got it to converge on a custom ViT across an 8-GPU mesh without deadlocking. ๐Ÿš€

What's inside:
A custom DeviceMesh implementation.
Manual P2P communication handling for pipeline stages.
Custom Column/RowParallelLinear layers.

I wrote a detailed deep-dive on the architecture (with diagrams of the communication flow) and the code is open source. Iโ€™m now working on adding ZeRO-style optimizer sharding and would love any pointers or feedback from the systems folks here!
Do star my repo if you find work to be meaningful.
Links:
๐Ÿ“– Blog (Visual Guide): https://medium.com/@shuklashashankshekhar863/quintnet-a-3d-distributed-training-library-db0181a33a80
๐Ÿ‘จโ€๐Ÿ’ป Repo: https://github.com/Wodlfvllf/QuintNet

My Linkedin Profile - https://www.linkedin.com/in/shashank-shekhar-shukla-722859227/
Thanks!

untold leaf
weary jackal
# untold leaf Which papers are you referencing? They seem weak sauce

Sorry forgot to add references. Been busy these days. By the way Megatron LM papers for model, tensor and pipeline parallelism, hugging face nanotron The ultrascale playbook. There youtube videos. Some more papers and there source codes references.

Would add it in some time. Additionally this can be a weak project as build just for fun and learning. Nothing more than that.

weary jackal
untold leaf
lavish hare
little jungle
still sail
#

Hey everyone ๐Ÿ‘‹
Iโ€™m Vimarsh, an MSc graduate currently working as an AI Engineer.

Iโ€™m actively looking to collaborate on ongoing research projects (AI/ML, applied research, or interdisciplinary work) as Iโ€™m building a stronger research background alongside industry work.

If anyone is open to collaboration or needs help with experiments, modeling, data, or writingโ€”happy to connect and contribute. Feel free to DM me!

Thanks ๐Ÿ™Œ

odd moth
#

Hi everyone! Iโ€™m doing research on deep learning optimization and wanted to share an experimental
second-order optimizer Iโ€™ve been working on (FROG). It uses row-wise Fisher preconditioning with batched Conjugate Gradient and aims to
improve time-to-accuracy with low overhead.

I wrote a short technical overview here:
https://github.com/Fullfix/frog-optimizer/blob/main/technical_overview.pdf

Code and CIFAR-10 experiments are in the same repo. Feedback very welcome ๐Ÿ˜ƒ .
Thanks for reading!

little jungle
whole palm
#

Hi @everyone
๐Ÿ“˜ Python Loops & Strings โ€“ Kaggle Notebook ๐Ÿ
This notebook explains Python loops (for, while) and strings in a detailed and easy-to-understand way, with clear examples.
Itโ€™s especially helpful for beginners ๐Ÿš€

Please check it out and leave a vote โญ and a comment ๐Ÿ’ฌ โ€” your feedback is highly appreciated! ๐Ÿ™Œ
https://www.kaggle.com/code/dastgeerjutt/3-loops-and-strings-detailed

little jungle
rose sequoia
#

Hi! I am trying to upload on arxiv, but I need my account to be endorsed first. If you have uploaded at least 3 AI-related papers there within the past 5 years, may I ask for your help? Thank you in advance!

weary jackal
# weary jackal Hi @everyone ๐Ÿ‘‹ For the last 4 months, Iโ€™ve been heads-down building QuintNetโ€”m...

Hi @everyone
Update on QuintNet: from ViTs โ†’ training a small GPT-2

A few months ago I shared QuintNet โ€” a PyTorch framework I built from scratch to understand 3D parallelism (Data + Tensor + Pipeline). At the time, it was validated on a custom ViT running on a 2ร—2ร—2 GPU mesh.

Since then, Iโ€™ve extended the same framework to GPT-2 (124M).

Moving from ViTs to a language model surfaced assumptions that hadnโ€™t shown up before, especially around model loading. When starting from a pretrained GPT-2 checkpoint, each rank initially materialized the full model before sharding, causing peak-memory blowups during initialization. Fixing this meant making the loader itself shard-aware so each GPU reads only the parameter slices it owns.

Pipeline parallelism also became trickier with GPT-style decoders due to tied input embeddings and output weights, which donโ€™t fit cleanly into a single pipeline stage.

QuintNet can now fine-tune GPT-2 in a distributed setup and converge stably. Still very much a personal learning project, but Iโ€™ve been documenting the lessonsโ€”especially around efficient distributed loading.

Links if you want to dig deeper:

๐Ÿ“– Blog: https://medium.com/@shuklashashankshekhar863/why-model-loading-breaks-3d-parallelism-and-how-safetensors-fixes-it-ce572d5e6fed

๐Ÿ‘จโ€๐Ÿ’ป GitHub: https://github.com/Wodlfvllf/QuintNet

Happy to discuss or get feedback from folks working on ML systems / distributed training.

little jungle
weary jackal
little jungle
#

but ig thats the way they would get to explore more

weary jackal
primal otter
#

GUYS! DO NOT CLICK THE ABOVE LINK

night prawn
#

๐Ÿš—โšก Just dropped a ๐Ÿ”ฅ Kaggle Masterpiece: Analyzed 271K Washington State EVs with INTERACTIVE MAPS, XGBoost Ensembles, & 2027 Forecasts!

Key Insights:
โœ… Tesla dominates 60% โ€“ but Chevy Bolt crushes on range/price
โœ… Urban Heatmaps reveal Seattle hotspots (download HTML map!)
โœ… ML Beast: Rยฒ=0.95 predicting range, 94% CAFV eligibility
โœ… Forecast: +50K new EVs by 2027 โ€“ infrastructure crisis ahead?

Built with GeoPandas, Folium, StackingRegressor (XGB+LGBM+RF). Perfect for policy makers & energy pros!

๐Ÿ”— Dive in & upvote: [https://www.kaggle.com/code/hammadansari7/electric-vehicle-population-analysis]

Whatโ€™s YOUR take on EV adoption? Rural lag or tech hype?

#DataScience #Kaggle #MachineLearning #GeospatialAnalysis #ElectricVehicles #EV #Forecasting #XGBoost #Sustainability #AI

@Kaggle @Tesla @robikscube @towardsdatascience @everyone

little jungle
#

Anyone any thoughts on it?

little jungle
hallow nimbus
#

Hey everyone! Iโ€™m super excited to share that my latest paper, "Emotion estimation from video footage with LSTM," has just been accepted in Frontiers in Neurorobotics! ๐ŸŽ‰

I developed a new model called BlendFER-Lite that uses MediaPipe Blendshapes and LSTMs to detect emotions from live video. The cool part? It matches the accuracy benchmarks of much heavier models (71% on FER2013) but with significantly lower computational cost, making it perfect for real-time robotics and edge devices.

Check it out here: ๐Ÿ“„ Paper: https://www.frontiersin.org/journals/neurorobotics/articles/10.3389/fnbot.2025.1678984/full ๐Ÿค— Code & Models: https://huggingface.co/papers/2501.13432

Would love to hear your thoughts or answer any questions!

little jungle
#

how is literature review done or anyone of u do it im new to the term so anyone pls ans ๐Ÿ˜“

cinder grail
halcyon cliffBOT
#
zakas. has been warned

Reason: Bad word usage

#
zakas. has been warned

Reason: Bad word usage

#
zakas. has been banned

Reason: Too many infractions

soft flax
#

Hello hackers,

I need some help. Iโ€™m training a conversation disentanglement model using this repo: https://github.com/jkkummerfeld/irc-disentanglement
. It will be used to prepare a conversation dataset for a project.

I donโ€™t have access to compute resources that can run continuously for five days. Iโ€™m using Google Colab, but sessions eventually stop when the tab closes or times out. I also canโ€™t afford a cloud provider right now.

If anyone has a home setup that can run uninterrupted for several days and is willing to help, I would really appreciate it. Thanks!

frail yew
vale pebble
worthy nimbus
#

any IISCER buddies here?? Im working on quantum physics beamsplitter experiments under yu ting-chen, we really a few ppl to be included in this anyone there for help?!1

raw obsidian
#

Is there anyone from India studying data analysis ?? want to do some analysis projects with some new ideas

violet fable
#

Is there any dataset for Semiconductor manufacturing materials/ parts

grave finch
#

Anyone can provide the best dataset download link for deepfake detection videos with good qualities videos and of various diiferent varities ?? It will be great help to me.

tacit anchor
ocean jetty
#

๐Ÿš€ ๐—ฃ๐—ฟ๐—ผ๐˜‚๐—ฑ ๐— ๐—ผ๐—บ๐—ฒ๐—ป๐˜! ๐— ๐˜† ๐—ฅ๐—ฒ๐˜€๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต ๐—ฃ๐—ฎ๐—ฝ๐—ฒ๐—ฟ ๐—ถ๐˜€ ๐—ฃ๐˜‚๐—ฏ๐—น๐—ถ๐˜€๐—ต๐—ฒ๐—ฑ ๐ŸŽ‰
Iโ€™m excited to share that my research paper titled:
โ€œ๐——๐—ถ๐˜€๐˜๐—ฟ๐—ถ๐—ฏ๐˜‚๐˜๐—ฒ๐—ฑ ๐— ๐—ฎ๐—ฐ๐—ต๐—ถ๐—ป๐—ฒ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ถ๐—ป๐—ด ๐˜„๐—ถ๐˜๐—ต ๐—ฆ๐—ฝ๐—ฎ๐—ฟ๐—ธ ๐— ๐—Ÿ๐—น๐—ถ๐—ฏ: ๐—” ๐—–๐—ผ๐—บ๐—ฝ๐—ฟ๐—ฒ๐—ต๐—ฒ๐—ป๐˜€๐—ถ๐˜ƒ๐—ฒ ๐—ฃ๐—ฒ๐—ฟ๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐—ป๐—ฐ๐—ฒ ๐—˜๐˜ƒ๐—ฎ๐—น๐˜‚๐—ฎ๐˜๐—ถ๐—ผ๐—ปโ€
has been successfully published in a peer-reviewed journal.
๐Ÿ” ๐˜๐˜ฏ ๐˜ต๐˜ฉ๐˜ช๐˜ด ๐˜ณ๐˜ฆ๐˜ด๐˜ฆ๐˜ข๐˜ณ๐˜ค๐˜ฉ, ๐˜ ๐˜ฆ๐˜น๐˜ฑ๐˜ญ๐˜ฐ๐˜ณ๐˜ฆ๐˜ฅ:
Performance of ML models on large-scale datasets (500K โ†’ 12M records)
Comparison of classification, regression, and clustering models
Impact of distributed computing using Apache Spark MLlib
Trade-offs between accuracy, speed, and memory usage
๐Ÿ“Š Key Insight:
Distributed machine learning significantly improves scalability and efficiency, especially for complex models on large datasets.
๐Ÿ“„ Read full paper here:
๐Ÿ‘‰ https://paas-pk.org/index.php/pjosr/article/view/2013
๐Ÿ’ก This work is part of my journey in Data Science & AI, and Iโ€™m excited to keep exploring more in distributed systems and big data.
#DataScience #MachineLearning #BigData #ApacheSpark #Research #AI #WomenInTech #KiranHayatDataScientist

barren musk
barren musk
steady jungle
#

Hey guys I'm currently researching what products can be made using UV rays but I'm not a researcher technically so can I get a hand or suggestions like what should I do first?

obtuse gulch
#

๐Ÿง  Join CVPR 2026 Challenge: Foundation Models for General CT Image Diagnosis!

Develop & benchmark your 3D CT foundation model on a large-scale, clinically relevant challenge at CVPR 2026!

๐Ÿ”ฌ What's the Challenge?

Evaluate how well CT foundation models generalize across anatomical regions, including the abdomen and chest, under realistic clinical settings such as severe class imbalance.

Task 1 โ€“ Linear Probing: Test your frozen pretrained representations directly.

Task 2 โ€“ Embedding Aggregation Optimization: Design custom heads, learning schedules, and fine-tuning strategies using publicly available pretrained weights.

๐Ÿš€ Accessible to All Teams

Teams with limited compute can compete via the Task 1 - Coreset (10% data) track, and Task 2 requires no pretraining โ€” just design an optimization strategy on top of existing foundation model weights.

Official baseline results offered by state-of-the-art CT foundation model authors.

A great opportunity to build experience and strengthen your skills: Task 1 focuses on pretraining, while Task 2 centers on training deep learning models in latent feature space.

๐Ÿ“… Key Dates

  • Validation submissions: โ€“ May 10, 2026
  • Test submissions: May 10 โ€“ May 15, 2026
  • Paper deadline: June 1, 2026

Weโ€™d love to see your model on the leaderboard and welcome you to join the challenge!

๐Ÿ‘‰Join & Register: https://www.codabench.org/competitions/12650/
๐Ÿ“งContact: medseg20s@gmail.com

still hazel
#

Hello everyone, if anyone is writing or planning to write a research paper, direct message me. We can collaborate & learn together.

still hazel
reef sluice
#

The World Has a Data Problem. We Fix It.
Every AI team hits the same wall eventually.
You have the model. You have the architecture. You have the engineers. But you don't have the data, and everything stops.
Maybe your dataset is too small to train on. Maybe it carries sensitive patient records, financial transactions, or personal identifiers that legal won't let you touch. Maybe you've been waiting months for a vendor to deliver labeled data that still isn't ready. Maybe your edge cases are so rare in real life that your model keeps failing exactly where it matters most.
This is not a skill problem. This is a data problem. And it is quietly killing more AI projects than any other single reason.
We generate synthetic data.
Not as a workaround. Not as a compromise. As a legitimate, statistically rigorous alternative that lets your team move again. We produce tabular, text, image, and time-series synthetic datasets that mirror the distributions, correlations, and behavioral patterns of real-world data without exposing a single real record.
We have solved this for teams in healthcare who couldn't share patient data across departments. For fintech companies building fraud detection models with almost no real fraud examples to train on. For startups that needed 10x their dataset size before a funding deadline. For enterprises blocked by GDPR, HIPAA, and compliance teams that said no to everything.
The problem you are sitting with right now, whether it is a privacy blocker, a data scarcity issue, a class imbalance, a regulatory wall, or a timeline that real data collection simply cannot meet, has a solution. We will tell you exactly what it is within 24 hours of hearing from you.
No long sales cycles. No vague proposals. You describe your data problem in plain language, and we come back with a concrete plan.
Send us your situation: [synthox.ai@gmail.com]
The only thing worse than a data problem is spending another month pretending it will resolve itself.

still hazel
dim sluice
#

why not?

tawdry lion
#

I am thrilled to announce that I have successfully defended my Masterโ€™s research! My work introduces a novel hybrid architecture combining YOLOv11 and YOLOv12 specifically designed to detect micro-anomalies in solar panels.

Key Highlights:

Precision: The modified model excels at identifying "tiny defects" that traditional inspection methods often miss.

Interpretability: Integrated EigenCAM to provide visual explanations for model predictions, ensuring the AI's decision-making process is transparent and reliable.

Proof of Concept: Iโ€™ve developed a demo (link below/attached) that showcases the real-time detection and model representation in action.

I'm excited to see how these advancements in computer vision can contribute to the renewable energy sector!

Demo Video : https://youtu.be/cxtnjsjD_iA