echo hamletBOT May 24, 2023, 6:31 PM

#

Hey @everyone, excited to announce our journal club / presentation titled Practical Applications of ML in Biology and Drug Discovery!

In this journal club we're going to go over various models and AI algorithms that have practical applications in molecular biology and drug discovery. The presentation is going to include a number of SOTA models that can currently be used in your existing workflows, as well as some of their strengths and limitations.

Hope to see you all there.

https://discord.gg/FMMDV2fA?event=1110997694484328478

echo hamletBOT May 25, 2023, 4:56 PM

#

@here Apologies in advance for the third consecutive ping 😅. We changed the event start time to 3 PM EST instead of 2 PM EST to accommodate those who are attending the weekly DNA diffusion meetup.

Once again my apologies for any confusion.

sterile cargoBOT May 25, 2023, 6:03 PM

#

📢 Harmonai Announcement - Office Hours, Beat Synced AI Animation Workshop, and Production Challenge 🎉

Attention, @everyone! We have some exciting news and events to share with you. Please read below for important updates:

1️⃣ Harmonai Office Hours is starting now! 🕒
Come by and hear about the latest from Harmonai and ask the team questions

2️⃣ ** Beat Synced AI Animation Workshop by Purz next Tuesday, May 31** 🎨🎵
We are thrilled to announce a special workshop by the talented artist, Purz! Join us for an immersive session on Beat Synced AI Animation. Purz will guide you through the creative process of combining music and animation. This workshop promises to be an incredible opportunity to expand your artistic horizons. This workshop is scheduled for 31st May.
https://twitter.com/PurzBeats

3️⃣ **Production challenge submissions are due tomorrow! ** ⏰
For all the ambitious creators out there, the Harmonai Production Challenge is well underway! Just a friendly reminder that the challenge submissions are due this Friday. Showcase your skills and innovation by crafting amazing compositions with Harmonai. We can't wait to see your entries!

4️⃣ ** Harmonai Challenge Showcase next Monday, May 29th** 🌟
Mark your calendars! Next Monday, May 29th, we will be hosting the Harmonai Challenge Showcase. This is your chance to witness the incredible talent within our community. Join us as we celebrate the outstanding compositions created during the production challenge. Be prepared to be inspired, amazed, and motivated by the diverse range of masterpieces made by our community!

old ventureBOT May 25, 2023, 6:38 PM

#

@everyone pickle is an egregiously unsafe file format to use to distribute ML models. A malicious pickle file can execute arbitrary code on your computer when you call unpickle, meaning that a bad actor can upload what looks like a model but is actually a Trojan giving them full access to your computer.

While this exploit is rate in the wild and there are no known cases of it happening with ML algorithms, it’s clearly not suitable to be the default format for distributing ML models.

We’ve been working with Hugging Face and Stability AI on this, and are now ready to announce several improvements to the security of the OS AI ecosystem:
0. SafeTensors, a library that is safe and is designed to support the needs of large scale AI researchers.

Commissioning and publicly releasing an independent audit of the library by the security firm Trail of Bits
Transitioning the HuggingFace hub to use the new library, including testing and converting existing models.
First-class support for the new format in HF’s and EAI’s ecosystem when it comes to sharing models externally.
All three orgs publicly committing to using safe serialization libraries for releases going forward.

We’re going to have a waiting period before making it the default to let users iron out any bugs we missed, but we will be making it the default format across our sectors of the ecosystem and we anticipate much if the rest of the OS LLM ecosystem following suite.

Note that it is absolutely safe to use your own or otherwise trusted pickle files, the issue is focused on when you download and unpickle files you find on the internet. When LLaMA was leaked, this was a serious concern I had. Fortunately, the leaked files were quickly confirmed to be identical to the officially released ones.

I want to give a huge shout out to the AI Village, a DEF CON group that focuses on AI and Security for being the reason I personally know about this issue and for making the introduction to Trail of Bits. The AIV has a discord server you can join here https://discord.gg/uz3hnZgnJN to discuss ML Security (it’s also found in #732688974337933322)

https://blog.eleuther.ai/safetensors-security-audit/

EleutherAI Blog

🐶Safetensors audited as really safe and becoming the default

Audit shows that safetensors is safe and ready to become the default Hugging Face, in close collaboration with EleutherAI and Stability AI, has ordered an external security audit of the safetensors library, the results of which allow all three organizations to move toward making the library the default format for saved models.
The full results o...

#

@everyone Our first work on mechanistic interpretability just came out! “Can Transformers Learn to Solve Problems Recursively?” by @dylanzhang @Curt Tigges @Stella Biderman (she/her) Maxim Raginsky, and @taliaringer trains small transformers on recursive tasks like computing the binary successor or transversing a tree.

In contrast to most ME work we find that the model fails to fully learn the tasks we study. This gives us a new way to validate our results though: looking at our reverse engineered algorithm we are able to identify a type of inputs that the model should fail in. Lo and behold, when we looked at the accuracy breakdown we found that it failed on those inputs 100% of the time and that they compressed 91% of all model failures!

The paper has a lot more, including documenting how the LR influences generalization strategies, an Abstract State Machine formalization, and looking at how different presentations of the same problem changes model behavior!

There’s a lot of directions for future work, some of which we are already discussing in our new channel #1110611369574793306 If you’re interested in using LLMs to study formal reasoning, definitely stop by and say hi.

Paper: https://arxiv.org/abs/2305.14699
Twitter thread: https://twitter.com/TaliaRinger/status/1661786081249964050?s=20

arXiv.org

Can Transformers Learn to Solve Problems Recursively?

Neural networks have in recent years shown promise for helping software
engineers write programs and even formally verify them. While semantic
information plays a crucial part in these processes, it remains unclear to what
degree popular neural architectures like transformers are capable of modeling
that information. This paper examines the beha...

sterile cargoBOT May 29, 2023, 5:52 PM

#

@everyone

📢 Harmonai Challenge Showcase - Starting Now! 🌟

Join us for the Harmonai Challenge Showcase, starting now in the Center Stage!

We'll be listening to challenge submissions from @Taera, @Greg White, @crlandsc, @lyra ✨, @NeuralNotWork, @jmoso13, @Jimney, @Silvio and our very own @ODDS

Come on through! 🎵✨

ancient spearBOT May 30, 2023, 2:09 AM

#

@everyone We're excited to share our first preprint since our public launch:

🧠👁️ MindEye!

Our state-of-the-art fMRI-to-image approach that retrieves and reconstructs images from brain activity

Project page: https://medarc-ai.github.io/mindeye/
arXiv: https://arxiv.org/abs/2305.18274

Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learn...

Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors

arXiv.org

Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learn...

We present MindEye, a novel fMRI-to-image approach to retrieve and
reconstruct viewed images from brain activity. Our model comprises two parallel
submodules that are specialized for retrieval (using contrastive learning) and
reconstruction (using a diffusion prior). MindEye can map fMRI brain activity
to any high dimensional multimodal latent s...

sterile cargoBOT Jun 1, 2023, 6:00 PM

#

Hey @everyone, Harmonai Office Hours is starting now!

Come join the Harmonai team in the Center Stage to hear about the latest AI Music developments.

https://discord.gg/Qpj4nzEd?event=1113832020653641768

sterile cargoBOT Jun 6, 2023, 5:59 PM

#

@everyone Starting now in the centre stage!
https://discord.gg/Sn9gmgyv?event=1113884419003007037

hazy otterBOT Jun 21, 2023, 7:51 AM

#

@everyone
【新企画第一弾】Stable Diffusion Life - あなたの活用法大募集　を開催！
仕事や趣味、思わぬ場所で使っている #StableDiffusion の活用法やエピソードを教えてください！
入選者にはDreamStudioクレジット20ドル分✖︎3人がプレゼント🎁＆公式noteやtwitterで紹介されます！ ▶[https://note.com/stabilityai/n/n357cd4bbc960]
まだまだ応募が少ないので、ぜひご参加ください！
ちなみに20ドル分は5000回分、Stable Diffusionが遊べます！

hazy nacelleBOT Jun 23, 2023, 6:12 PM

#

https://twitter.com/jsuarez5341/status/1672306455092027392?s=20 @Neural-MMO 🦾

Joseph Suarez (@jsuarez5341)

Announcing the Neural MMO 2.0 Competition on Multi-Task Reinforcement Learning and Curriculum Generation at NeurIPS 2023! Partnered with @StabilityAI @carperai @ParametrixAI @aicrowdHQ! Details in the coming weeks … 1/4🧵

hazy nacelleBOT Aug 8, 2023, 6:54 PM

#

@model release Some exciting news from our awesome @code.ai team!! We finally are live with what we have been working on the past few months: StableCode our first step towards helping the world get to 1 billion developers! https://twitter.com/StabilityAI/status/1688931312122675200

Stability AI (@StabilityAI)

🚀Exciting news! Stability AI has launched StableCode, the revolutionary generative AI LLM for coding!

💡 Developers, get ready to level up your coding game! #AI #Coding #StableCode #StabilityAI

https://t.co/XFrV36JMMu

Likes

386

Retweets

121

hazy otterBOT Aug 10, 2023, 2:22 AM

#

@everyone
【新公開】 Stability AI Japan は最高性能の日本語言語モデル「Japanese StableLM Base Alpha 7B」と「Japanese StableLM Instruct Alpha 7B」を公開しました！
https://ja.stability.ai/blog/japanese-stablelm-alpha
そこで、今夜18時より公式DiscordでLLM開発者とのトークイベントを開催します。技術的な内容にはなりますが初めての方も大歓迎です！質問セッションもありますので、皆様ぜひご参加ください🎉 👉
https://discord.com/invite/KGTF3m2U?event=1139018615358767145

Stability AI Japan

日本語言語モデル「Japanese StableLM Alpha」をリリースしました — Stability AI Japan

Stability AI Japan は70億パラメータの日本語向け汎用言語モデル「Japanese StableLM Base Alpha 7B」及び、指示応答言語モデル「Japanese StableLM Instruct Alpha 7B」を一般公開しましたこのモデルはベンチマークスイート「lm-evaluation-harness」による複数の日本語タスクを用いた性能評価において、一般公開されている日本語向けモデルで最高の性能を発揮しています。

hazy nacelleBOT Aug 19, 2023, 1:03 AM

#

Big congrats to @Shahbuland @Tanishq on the initial release of DRLX! Featuring an accelerated DDPO trainer and various reward functions. Stay tuned for RLHF-enhanced image gen!
https://twitter.com/carperai/status/1692696352583557190?s=20

Carper, a Stability AI lab (@carperai)

At Carper, we made TRLX to align LLMs with human feedback and now we're gonna do the same for diffusion models with DRLX! The initial release features an accelerated DDPO trainer and various reward functions. Stay tuned for RLHF-enhanced image gen!
https://t.co/5aIPuicuLR

sterile cargoBOT Sep 14, 2023, 5:26 PM

#

@everyone

📢 We're super excited to share the release of Stability AI's new text-to-audio platform: Stable Audio! 📢

🎵 Try it out now at https://www.stableaudio.com and share what you make 🎵

🖥 Training and inference code, as well as open-source models will be coming soon! 🖥️

If you want to join the discussion, head on over to the #stable-audio channel in the Stable Foundation server: https://discord.gg/k3vkc5vE

We'll be hosting our Harmonai Office Hours in a little over half an hour to discuss Stable Audio, and our upcoming open-source releases. Come on through! https://discord.gg/92kynGB4?event=1151585333012611104

Stable Audio - Generative AI for music & sound fx

Make original music and sound effects using artificial intelligence, whether you’re a beginner or a pro.

sterile cargoBOT Oct 12, 2023, 6:07 PM

#

Hey @everyone, Harmonai Office hours are starting! https://discord.gg/ETrJTYNk?event=1162087709309931643

Also, we've made our training repo for our audio generation models public!

Check out https://github.com/Stability-AI/stable-audio-tools to see what we've been working on. No pre-trained models there yet, those will be coming soon

GitHub

GitHub - Stability-AI/stable-audio-tools: Generative models for con...

Generative models for conditional audio generation - GitHub - Stability-AI/stable-audio-tools: Generative models for conditional audio generation

old ventureBOT Oct 17, 2023, 4:40 PM

#

@everyone
Continuing EleutherAI’s mission of pushing forward open research and broadening access to the tools that make this possible, we are releasing Llemma, a family of powerful base models for mathematics trained via continued pretraining of CodeLlama on a general mathematics dataset for up to 200B tokens.

Why is this important?
A year ago, Google published Minerva, a LLM with impressive mathematical reasoning abilities. Minerva isn’t publicly accessible, preventing research from building on these advances. This has hindered outside progress in the Math+AI subfield greatly!

Just like CodeLlama has helped spur advances in open AI for code research, we hope that others will build on Llemma to be a strong platform for furthering the study of AI for mathematics! We release our models, datasets, and training, evaluation, and analysis code.

This is the beginning of our research on this topic, not the end. Our current plans include furthering few-shot theorem proving and finetuning for full-proof generation, and much more. Come join us in https://discord.com/channels/729741769192767510/1112407059359600662⁠ to get involved. We also meet on Thursdays at 2pm US Eastern Time.

Work done through collaboration between EleutherAI and several academic labs, by @zhangir_azerbayev @Hailey Schoelkopf @keirp @dsantosmarco @mcaleste @Albert Jiang Jia Deng @Stella Biderman (she/her) @wellecks !

Blog post: https://blog.eleuther.ai/llemma
ArXiv paper: https://arxiv.org/abs/2310.10631
Project page: https://github.com/EleutherAI/math-lm
Sample explorer: https://llemma-demo.github.io/

old ventureBOT Oct 24, 2023, 9:53 PM

#

Hi all! @NLP

We just released a new paper on Quality-Diversity through AI Feedback (QDAIF), a way for LLMs to automatically generate meaningfully diverse, high-quality text responses in creative domains (like generating stories and poems).

For many tasks we want a diverse range of high-quality outputs from models to choose from. QD algorithms aim towards this, but it's challenging to define measures for quality and diversity by hand in subjective domains like creative writing. Inspired by RLAIF, what if LLMs assessed qualitative features of diversity, too? That way LLMs could generate, diversify, and improve their own responses.

QDAIF enables this search for diverse, high-quality solutions, overcoming the limitations of hand-crafted measures in creative writing domains (opinions, stories, poetry). We found QDAIF to be better suited in creative writing domains at covering the search space with diverse, high-quality stories, poems, etc., compared to baselines and verified the grounding of QDAIF through human evaluation.

This work was part of a research collaboration between EleutherAI, CarperAI, StabilityAI, Aleph Alpha, UBC, and others. Shoutout to @andmany ,@joellehman , and@lactoseintol (and any others I've missed), not to mention @gooseluvr for the support!

Project page: https://qdaif.github.io/
ArXiv: https://arxiv.org/abs/2310.13032
Tweet: https://x.com/andrewdai99/status/1716913881816383805?s=20

Quality-Diversity through AI Feedback

hazy nacelleBOT Oct 24, 2023, 9:53 PM

#

Hi all!

We just released a new paper on Quality-Diversity through AI Feedback (QDAIF), a way for LLMs to automatically generate meaningfully diverse, high-quality text responses in creative domains (like generating stories and poems).

For many tasks we want a diverse range of high-quality outputs from models to choose from. QD algorithms aim towards this, but it's challenging to define measures for quality and diversity by hand in subjective domains like creative writing. Inspired by RLAIF, what if LLMs assessed qualitative features of diversity, too? That way LLMs could generate, diversify, and improve their own responses.

QDAIF enables this search for diverse, high-quality solutions, overcoming the limitations of hand-crafted measures in creative writing domains (opinions, stories, poetry). We found QDAIF to be better suited in creative writing domains at covering the search space with diverse, high-quality stories, poems, etc., compared to baselines and verified the grounding of QDAIF through human evaluation.

This work was part of a research collaboration between EleutherAI, CarperAI, StabilityAI, Aleph Alpha, UBC, and others. Shoutout to @andmany @joellehman , and @lactoseintol (and any others I've missed), not to mention @canadagoose1 for the support!

Project page: https://qdaif.github.io/
ArXiv: https://arxiv.org/abs/2310.13032
Tweet: https://x.com/andrewdai99/status/1716913881816383805?s=20

old ventureBOT Dec 6, 2023, 6:42 PM

#

@everyone We've had a number of papers accepted for publication recently, including both previously released ones and new ones. In lieu of many posts today, I figured batching them probably makes more sense 🙂 I've provided some annotations to help guide which might be new to you.

Apologies if I'm missing your tag or paper! Please DM me with corrections.

EMNLP
RWKV: Reinventing RNNs for the Transformer Era (Findings) by @blinkdl @hypnopump @Quentin Anthony, et al.

trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback by @alexhavrilla @maxreciprocate @duyphung.ai @1.69 @Ryan Gosling @Stella Biderman (she/her) @Quentin Anthony Ethan Kim and @ihateihatelouis paper forthcoming

NeurIPS
The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs (spotlight) by @Laura Ruis Akbir Khan, @Stella Biderman (she/her), @sara_hooker, Tim Rocktäschel, and Edward Grefenstette major paper update

LEACE: Perfect linear concept erasure in closed form by @norabelrose @dsj @shauli_ @rcotterell @edwardraff @Stella Biderman (she/her)

Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors by Scotti et al. incl @ilovescience (spotlight) previously unannounced

Emergent and Predictable Memorization in Large Language Models by @Stella Biderman (she/her) @Orz @lintangsutawika @Hailey Schoelkopf @Quentin Anthony @Ryan Gosling and @edwardraff

Math-AI Workshop
OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text by @keirp @dsantosmarco @zhangir_azerbayev and Jimmy Ba

Llemma: An Open Language Model For Mathematics by @zhangir_azerbayev @Hailey Schoelkopf @keirp@dsantosmarco @mcaleste @Albert Jiang @jianga2718@Stella Biderman (she/her) @wellecks

Socially Responsible Language Modelling Research Workshop
Eliciting Language Model Behaviors using Reverse Language Models (spotlight) by Pfau et al. incl. @alexinfanger and @ai_waifunew paper

@Stella Biderman (she/her) has an invited panel.

Workshop on Backdoors in Deep Learning
Detecting Backdoors with Meta-Models by Langosco et al. incl. @Hyperion new paper

__Workshop on Attributing Model Behavior at Scale (ATTRIB) __
Sparse Autoencoders Find Highly Interpretable Features in Language Models by @hoagy @aidan ewart @loganriggs @Robert_AIZI @leesharkey

Apache Cassandra Conference
@picocreator has an invited talk on RWKV

Nature
Roleplay with Large Language Models by Murray Shanahan and @repligate previously unannounced

Preprints
The OpenELM Library by @Hyperion @Honglu Ryan Zhou, Daniel Scott, and @joellehman new paper

Meet-ups
If you want to meet up with EleutherAI check out #1171809525280550932 #1171291697561477170 #1182032181921587200 respectively.

knotty pagodaBOT Dec 25, 2023, 7:21 AM

#

Happy holidays to you all! 🎄

echo hamletBOT Jan 1, 2024, 7:32 PM

#

@everyone Wishing you a Joyous New Year! As we step into this exciting chapter, we extend our heartfelt gratitude to each of you who joined our community. Whether you enriched our open research initiatives or engaged in spirited discussions across our channels, your contributions have been invaluable. A sincere thank you to everyone, and we look forward to sharing more meaningful moments with you in the upcoming year! Cheers to a fantastic year ahead! 🎉

https://tenor.com/view/funny-animals-kittens-smiling-cute-cats-happy-gif-12099363

Tenor

ancient spearBOT Jan 25, 2024, 1:14 PM

#

@everyone We're excited to share some of our first work from our LLM efforts!

Our goal is to eventually train open LLMs that have SOTA medical capabilities, but first we must understand how current LLMs perform.

That's what we do in our new blog post!

Highlights:

• We've implemented Google's MultiMedQA suite of tasks in
EleutherAI's lm-eval-harness for easy eval of open LLMs

• We've discovered that SOTA generalist open LLMs like Qwen-72b outperform Med-PaLM (SOTA in Nov. 2022) and even openly released medical LLMs like Meditron-70b, all without any special prompting

• We perform a dataset contamination analysis and don't observe any strong signs of test set contamination

• Lots of future directions to explore for medical LM evals, this blog post is just part 1!

Read here → https://www.medarc.ai/blog/medarc-llms-eval-part-1

MedARC tweet → https://twitter.com/MedARC_AI/status/1750506121545359862

MedARC

Evaluating the Medical Knowledge of Open LLMs - Part 1 — MedARC

Explore how Large Language Models (LLMs) like GPT-4 are transforming healthcare. MedARC delves into their use in clinical decision support and administrative tasks. Our focus includes comparing generalist and medical domain-specific LLMs, and evaluating their performance on MultiMedQA tasks for medi

MedARC (@MedARC_AI) on X

Sharing our first work from our LLM efforts!

We've evaluated the medical knowledge of open LLMs (Mistral, Llama -2, etc.) & compared them to closed LLMs like GPT-4 which are SOTA.

Open LLMs perform surprisingly well, read our blog post to learn more! ↓
https://t.co/V57ij14uSV

old ventureBOT Feb 9, 2024, 4:28 AM

#

@everyone Do neural nets learn features in a predictable order?

Our results suggest the answer is “yes”— networks learn statistics of increasing complexity. Early-training networks only use low-order moments (mean & covariance) of the input distribution.

Specifically, we show that networks automatically learn to perform well on maximum-entropy distributions whose low-order statistics match those of the training set early in training, then lose this ability later.

We also extend our theory to language models by proving an equivalence between token n-gram frequencies and the moments of embedding vectors. Empirically, we find a fascinating double descent phenomenon: Pythia does well on unigram & bigram sequences in the first ~256 steps, then gets worse as it learns higher-order n-gram statistics, then gets better again by using in-context learning to adjust to the new distribution.

Finally we use optimal transport methods to surgically edit the low-order statistics of one class to match those of another, and show that early-training networks treat the edited samples as if they were drawn from the target class.

Paper: https://arxiv.org/abs/2402.04362
Code: https://github.com/EleutherAI/features-across-time
Twitter thread: https://x.com/norabelrose/status/1755680678736547910?s=20

Thanks to @quintinpope @luciaquirke @Alex Mallen @.xfern

old ventureBOT Mar 7, 2024, 4:28 PM

#

@everyone Most large language models trained last year were multilingual, but our tooling for evaluating models trained in languages other than English and Chinese are quite limited. Often times, organizations will even evaluate their models by translating evaluation benchmarks. However the kinds of questions of interest to people in different countries and cultures differs, and sometimes the correct answer to an allegedly objective question differs by language!

To improve evaluation practices for Korean language we've been working with Korean NLP researchers and industry practitioners to build two new evaluation datasets:

Hae-Rae Bench: This evaluation benchmark contains six tasks across four domains: vocabulary, history, general knowledge, and reading comprehension. HAE-RAE Bench emphasizes a model's aptitude for recalling Korean-specific knowledge and cultural contexts, presenting a greater challenge to non-native models, by disturbing abilities and knowledge learned from crosslingual transfer. This work was done by @GSON @albert_h_lee .

K-MMLU: This benchmark replicates the methodology that produced MMLU, but using examinations common in Korea. We manually annotate a subset of the questions as to whether they require Korea-specific knowledge and also designate a KMMLU-Hard subset that current models find especially challenging. This work was done by @GSON @albert_h_lee @lliy8786 @muennighoff @Stella Biderman (she/her) .

Hae-Rae Bench has been accepted to LREC-COLING 2024, and KMMLU is under review at ACL. Both of them can be run today via the Language Model Evaluation Harness.

If you speak a language other than English or a non-mainstream culture in an English-speaking country and would like to talk about designing a benchmark to measure language model competencies that matter to you, come join us in the new #1208111628051152969 channel. If you're interested in evaluation more generally or want help using our evaluation framework, come by #755950983669874798

hazy otterBOT Mar 21, 2024, 3:32 AM

#

@everyone
皆様、いつもありがとうございます。Stability AI Japanより、２点のニュースをお届けします。

画像生成AI Stable Diffusion スタートガイド予約受付中

Stability AI 公式パートナー企業「AICU Inc.」より、『画像生成AI Stable Diffusion スタートガイド』が発売されます。

当社代表Jerry Chiはじめ、Stability AI Japanメンバーもレビューに公式参加しております。
技術者の方からAIイラストレーターの方まで、Stable Diffusionの基礎から応用まで学んで頂けます。
拡張機能や生成テクニックについても解説されており、初心者の方でも本書によって思い通りのイラストが生成できます。

3月29日(金曜日) 発売予定です。ご予約は以下のリンクから、ぜひお願いいたします。
https://j.aicu.ai/SBXL

Stability AI Japan × NVIDIA GTC24 開催記念キャンペーン実施中

NVIDIA主催「GPU Technology Conference 2024（GTC2024）」が3月19日(火)から22日(金)まで開催中です。

これを記念して、『NVIDIA CEO ジェン・スン・フアンサイン入りGeForce RTX 4090』が当たるキャンペーンを実施します。
GTC2024 AIカンファレンスのお好きなセッションを視聴するだけで、抽選に応募できます。

応募方法

公式X @StabilityAI_JP をフォロー & リポストして応募
https://twitter.com/StabilityAI_JP/status/1767737620736659863
以下のリンクから#GTC24 に無料参加登録。セッションを1つ以上視聴
※登録時、Location を Japan（日本）にしている方が対象です。
https://nvidia.com/ja-jp/gtc/?ncid=ref-inpa-805225

応募は 3月27日(火曜日) 午後4時59分 まで。
NVIDIA CEOサイン入りGPUが当選する貴重なチャンスです。ぜひ奮ってお申込みください。

hazy otterBOT Mar 26, 2024, 3:00 AM

#

@everyone

Stable Code Instruct 3B リリース！

Stable Code 3Bをベースにした新しい指示学習済みLLM、「Stable Code Instruct 3B」をリリースしました。
このモデルを利用することで、自然言語プロンプトによってコード生成／数学／その他のソフトウェアエンジニアリング関連の出力など、
様々なタスクを処理することができます。

Codellama 7B InstructやDeepSeek-Coder Instruct 1.3Bなど、同等以上のサイズのモデルに匹敵する性能を持ちます。
また、Stability AIメンバーシップを取得することで、商用利用することができます。

詳細は以下の記事をご覧ください。 shootingstars
https://ja.stability.ai/blog/stable-code-instruct-3b

Stability AI Japan

Stable Code Instruct 3B のご紹介 — Stability AI Japan — Stability AI Ja...

Stable Code Instruct 3Bは、Stable Code 3Bの上に構築された、最新の指示学習済み大規模言語モデルです。このモデルは、コード補完を強化し、自然言語インタラクションをサポートすることで、プログラミングやソフトウェア開発に関連するタスクの効率性と直感性を向上させることを目的としています。私たちの分析によると、Stable Code Instruct 3Bは、様々なコーディング関連タスクにおいて、Codellama 7B InstructやDeepSeek-Coder Instruct 1.3Bなどの同等のモデルを凌駕しています。

hazy otterBOT Apr 4, 2024, 12:30 AM

#

@everyone

Stable Audio 2.0 デモサイトリリース！

Stable Audio 1.0をベースに構築された新モデル「Stable Audio 2.0」のデモサイトを公開しました。

このモデルは、44.1KHzステレオで最大3分間の高品質なフルトラックを生成できます。
オーディオからオーディオへの変換機能も備えています。
オーディオサンプルと自然言語によるプロンプトを用いて、さまざまなサウンドを生成できます。

また、サウンドエフェクトの生成とスタイルの転送も拡張され、
アーティストやミュージシャンに柔軟性とコントロール性を提供し、クリエイティブなプロセスを向上させます。

このモデルはデモサイトで無料で使用することが可能です。制作を始めてみてください。
https://stableaudio.com/

詳細はこちらの記事をご覧ください。 purplestar
https://ja.stability.ai/blog/stable-audio-20

Stability AI Japan

Stable Audio 2.0 のご紹介 — Stability AI Japan

Stable Audio 2.0をご紹介します。このモデルは、1つの自然言語プロンプトから44.1KHzステレオで最大3分の首尾一貫した音楽構造を持つ高品質なフルトラックを可能にします。この新しいモデルは、テキストからオーディオへの変換にとどまらず、オーディオからオーディオへの変換機能も備えています。ユーザーはオーディオサンプルをアップロードし、自然言語によるプロンプトを通じて、これらのサンプルをさまざまなサウンドに変換できます。このアップデートでは、サウンドエフェクトの生成とスタイルの転送も拡張され、アーティストやミュージシャンに柔軟性とコントロール性を提供し、クリエイティブなプロセスを

sterile cargoBOT Apr 4, 2024, 9:02 PM

#

@everyone

📣 🎵 We're thrilled to announce the launch of Stable Audio 2.0! 🎵 📣

This new model enables higher quality outputs up to three minutes with improved prompt fidelity, and is now available to all users at https://www.stableaudio.com.

We've also enabled the ability to upload your own audio for style transfer!

Check out our full release blog post here, with a few details on the model implementation: https://stability.ai/news/stable-audio-2-0

We're also super excited to bring back our 24/7 Stable Radio with music made entirely by Stable Audio! Check it out here: https://www.youtube.com/watch?v=yvOXZ6SV2Rk

Stability AI

Introducing Stable Audio 2.0 — Stability AI

Today, we are pleased to introduce Stable Audio 2.0. This model enables high-quality, full tracks with coherent musical structure up to three minutes long at 44.1 kHz stereo from a single natural language prompt.

YouTube

Stable Audio

Stable Radio 24/7

Stable Radio, a 24/7 live stream that features tracks exclusively generated by Stable Audio.
Explore the model and start creating for free on stableaudio.com

▶ Play video

hazy otterBOT Apr 18, 2024, 3:01 AM

#

@everyone

【続】Stable Diffusion 3 API リリース！& 詳細な情報提供のお知らせ

昨日リリースさせて頂いたStable Diffusion 3 APIについて、より詳細な情報を提供させて頂きます。
ぜひ、日本語ブログをご一読くださいませ。

Stable Diffusion 3のリサーチペーパーで明らかにされているように、このモデルは、人間の嗜好評価に基づいて、DALL-E 3 や Midjourney v6 などのテキスト画像生成システムをタイポグラフィとプロンプトの忠実性において上回っています。

新しいMultimodal Diffusion Transformer (MMDiT)アーキテクチャは、画像表現と言語表現に別々のウェイトセットを使用することで、Stable Diffusionの旧バージョンと比較して、テキスト理解とスペリング機能が向上しています。

モデルは本日よりAPIを通じて利用可能ですが、私たちはオープンなリリースに先立ち、モデルの改善に継続的に取り組んでいます。私たちのオープンな生成AIへの取り組みに基づき、近い将来に、Stability AIメンバーシップでモデルのウエイトを利用できるようにすることを目指しています。

より詳しい情報は、以下の日本語ブログ🗾 をご覧ください！
https://ja.stability.ai/blog/stable-diffusion-3-api

また、Stable Diffusion 3 APIを無料ですぐ試せるColabを作成しました。ご利用ください。
https://x.com/xqdior/status/1780618334607942054

Stability AI Japan

Stable Diffusion 3 API のご紹介 — Stability AI Japan

Stability AI Developer Platform APIでStable Diffusion 3およびStable Diffusion 3 Turboをご利用いただけるようになりました。

D̷ELL (@xqdior) on X

本日、Stable Diffusion 3とStable Diffusion 3 Turboが、#StabilityAI Developer Platform APIで利用可能になりました。

#StableDiffusion3 をすぐ試せるColab Notebookを作成しました。
お気軽にご利用ください。
https://t.co/EU3n9D7Czd

hazy otterBOT May 9, 2024, 5:06 AM

#

@everyone

Japanese Stable LM 2 1.6B リリース！

日本語大規模言語モデル「Japanese Stable LM 2 1.6B」をリリースしました。

Japanese Stable LM 2 1.6B（JSLM2 1.6B）は、16億パラメータで学習した日本語の小型言語モデルです
モデルサイズを16億パラメータという少量にすることにより、必要なハードウェアを小規模に抑え、多くの開発者が生成AIのエコシステムに参加できます
ベースモデルとしてJapanese Stable LM 2 Base 1.6Bと、指示応答学習（Instruction tuning）済みのJapanese Stable LM 2 Instruct 1.6Bを提供します

こちらのモデルはStability AI メンバーシップにご加入いただくことで商用利用が可能です。

詳細は以下のブログをご覧ください！ purplestar
https://ja.stability.ai/blog/japanese-stable-lm-2-16b

Stability AI Japan

日本語大規模言語モデル「Japanese Stable LM 2 1.6B」をリリースしました — Stability AI Japa...

日本語大規模言語モデル「Japanese Stable LM 2 1.6B」をリリースしました Japanese Stable LM 2 1.6B（JSLM2 1.6B）は16億パラメータで学習した日本語の小型言語モデルです。ベースモデルとしてJapanese Stable LM 2 Base 1.6Bと、指示応答学習（Instruction tuning）済みのJapanese Stable LM 2 Instruct 1.6Bを提供します。両モデルともStability AI メンバーシップで商用利用が可能です。

hazy otterBOT May 10, 2024, 1:49 AM

#

@everyone

DiscordApp「Stable Artisan」リリース！

Discord 上でStability AIのモデルを直接使用できる、「Stable Artisan」をリリースしました

Stability AI の開発者プラットフォーム API の機能が、より幅広いユーザーに利用できるようになります
Stable Diffusion 3、Stable Video、Stable Image Core などの高度なモデルを搭載した Stable Artisan により Discord 内で直接、高品質のメディアを作成できます
検索と置換、背景の削除、クリエイティブ・アップスケール、アウトペインティングなど、作品を編集するためのツールが用意されています。

Discordサーバー：https://discord.gg/stablediffusion

詳細は以下のブログをご覧ください！（英語） purplestar
https://ja.stability.ai/blog/stable-artisan

Stability AI Japan

Stable Artisan　Discord上でのメディア生成と編集 — Stability AI Japan

Stable Diffusionコミュニティからの要望の中で最も多いものの一つが、Discord上で当社のモデルを直接使用する機能です。本日、Discord上でメディアを生成するための使いやすい、Stable Artisanをご紹介します。

echo hamletBOT May 13, 2024, 12:36 AM

#

Hey @everyone, some extremely cool people prepared an AlphaFold3 letter to the Nature editor criticizing the journal for not upholding their policies about making code available to reviewers and alongside publications and also expressing concern about the precedent this sets. The letter and a form to collect endorsements are at https://docs.google.com/forms/d/e/1FAIpQLSf6ioZPbxiDZy5h4qxo-bHa0XOTOxEYHObht0SX8EgwfPHY_g/viewform

Google Docs

Letter to the Editor: AlphaFold3

We are submitting the follow as a Letter to the Editor and will post the text immediately on Zenodo. If you would like to endorse to this letter, please fill out the form below.
Authors:
Stephanie A. Wankowicz, UCSF
Pedro Beltrao, ETH
Benjamin Cravatt, Scripps
Roland Dunbrack, FCCC
Anthony Gitter, UW Madison
Kresten Lindorff-Larsen, Copenhagen
...

sterile cargoBOT Jun 5, 2024, 5:01 PM

#

@everyone

📢 Stable Audio Open 1.0 is available to the public! 📢

Model weights are available at https://huggingface.co/stabilityai/stable-audio-open-1.0

This model is trained to generate sound effects, samples, and field recordings. Great for making samples for your music!

You can use the stable-audio-tools repo to fine-tune this model on your own sample libraries and create your own custom Stable Audio models.

We're so excited to get this out, can't wait to see what y'all make with it!

stabilityai/stable-audio-open-1.0 · Hugging Face

hazy otterBOT Jun 12, 2024, 1:16 PM

#

@everyone

Stable Diffusion 3 medium リリース！

本日、Stable Diffusion 3シリーズの最新かつ最も先進的なテキストから画像へのAIモデルであるStable Diffusion 3 Mediumのオープンウエイトを発表できることを嬉しく思います！🎊
この新しいリリースは、ジェネレーティブAIの進化における重要なマイルストーンであり、このパワフルなテクノロジーを民主化するという私たちのコミットメントを継続するものです。

SD3 Medium は、SD3の20億パラメーターモデルで、いくつかの特筆すべき特徴を備えています。

フォトリアリズム: 手や顔によく見られる不自然さを克服し、複雑なワークフローを必要とせずに高品質の画像を提供します。
プロンプトの忠実さ: 空間的関係、構成要素、動作、スタイルを含む複雑なプロンプトを理解します。
テキスト生成: Diffusion Transformer architecture により、ノイズやスペルミスのないテキスト生成において、これまでにない結果を達成します。
リソース効率: 低いVRAMフットプリントにより、標準的なコンシューマー向けGPUでパフォーマンスを低下させることなく実行することができます。
ファインチューニング: 小さなデータセットから微妙なディテールを理解することができ、カスタマイズに最適です。

詳しくはこちら🎉
https://ja.stability.ai/blog/stable-diffusion-3-medium

Stability AI Japan

最も洗練された画像生成モデル、Stable Diffusion 3 Medium のオープンリリースを発表 — Stability A...

Stable Diffusion 3シリーズの最新かつ最も先進的なテキストから画像へのAIモデルであるStable Diffusion 3 Medium を発表します。

hazy otterBOT Jul 24, 2024, 11:58 PM

#

@everyone

Stable Video 4D リリース！

Stable Video 4Dは、ユーザーが1つのビデオをアップロードするだけで、8つの新しいアングルのダイナミックなノベル・ビュー・ビデオを受け取ることができ、
新たなレベルの多様性と創造性を提供する、Stability AI 初のvideo-to-video 生成モデルです。

1つのオブジェクトビデオを、8つの異なるアングル/ビューの複数のノベルビュービデオ に変換します。
1回の推論で、8つのビューにわたる5フレームを約40秒 で生成します。
ユーザーはカメラアングルを指定でき、特定のクリエイティブなニーズに合わせて出力を調整することができます。

詳細は以下のブログをご覧ください！ purplestar
https://ja.stability.ai/blog/stable-video-4d

Stability AI Japan

Stable Video 4D : ダイナミックなマルチアングル映像生成のための最新AIモデル — Stability AI Japa...

1つのビデオをアップロードするだけで、8つの新しいアングル/ビューのダイナミックノベルビュービデオを受け取ることができる革新的なモデル、Stable Video 4Dをご紹介します。

old ventureBOT Oct 11, 2024, 5:39 PM

#

Hey @everyone! The HPC team lead by @Quentin Anthony has been hard at work keeping our GPT-NeoX library at the forefront of large scale AI training. The most recent major feature, with @dmayhem93 (Super Saiyan Aligned) @Not Not Louis e/🐘 and @nathanthinks at SynthLabs and @ai_waifu, is the introduction of post-training to GPT-NeoX. Now you can do SFT, DPO, and KTO finetuning native to the GPT-NeoX library itself and we have other algorithms including REINFORCE and PPO on the way.

Our testing shows a 30% performance improvement over HuggingFace's trl library at the 13B scale, with the added bonus of being scalable to massive computing systems that trl doesn't support.

This is part of a broader push to improve the GPT-NeoX library and continue to power open research at scale on frontier HPC systems. Preference learning joins other new features such as:

AMD GPUs
Mixture-of-Experts (MoE) layers
RWKV and Mamba
Sequence parallelism
as part of our forthcoming 3.0 release. All of these features can be tested today on main if you don't want to wait for the stable release though! GPT-NeoX 3.0 is currently in pre-release bug testing so if you give it a try stop by #730090096287547444 and let us know what your experience is like.

Check out our blog post (and SynthLabs' here) to learn more or head over to the GPT-NeoX library to give it a try.

EleutherAI Blog

RLHF and RLAIF in GPT-NeoX

GPT-NeoX now supports post-training thanks to a collaboration with SynthLabs.

hazy otterBOT Oct 22, 2024, 2:35 PM

#

@everyone

Stable Diffusion 3.5 Large & Large Turbo リリース！

ポイント

カスタマイズ性：特定のクリエイティブニーズを満たすために、モデルを簡単にファインチューニングしたりカスタマイズされたワークフローに基づくアプリケーションを構築したりすることができます。
効率的なパフォーマンス：特にStable Diffusion 3.5 MediumおよびStable Diffusion 3.5 Large Turbo モデルでは標準的な一般消費者向けのハードウェアで高負荷をかけずに実行できるように最適化されています。
多様な出力：広範な指示を必要とせずに、特定の人物だけでなく、さまざまな肌の色や特徴を持つ世界を代表するような画像を作成します。

リリースモデル

Stable Diffusion 3.5 Large: 80億のパラメータ、優れた品質、迅速な適合性を持つこの基本モデルは、Stable Diffusionファミリーの中で最も強力です。このモデルは、1メガピクセルの解像度でのプロフェッショナルな使用事例に最適です。
Stable Diffusion 3.5 Large Turbo: Stable Diffusion 3.5 Large の蒸留版であり、わずか4ステップで高品質な画像を生成し、優れた即時適合性を実現します。Stable Diffusion 3.5 Largeよりもはるかに高速です。
Stable Diffusion 3.5 Medium (10月29日リリース予定): 26億のパラメータ、改良されたMMDiT-Xアーキテクチャとトレーニング方法により、カスタマイズのしやすさと画質を両立させ、コンシューマー向けハードウェアで「箱から出してすぐに使える」ように設計されています。0.25～2　メガピクセルの解像度の画像を生成できます。

Stable Diffusion 3.5 Large および Stable Diffusion 3.5 Large Turbo は、現在 Hugging Face からダウンロードでき、GitHub では推論コードも入手可能です。
詳細は以下のブログをご覧ください！ purplestar
https://ja.stability.ai/blog/introducing-stable-diffusion-3-5

Stability AI Japan

Stable Diffusion 3.5 のご紹介 — Stability AI Japan

Stable Diffusion 3.5 をご紹介します。このオープンリリースには、Stable Diffusion 3.5 Large および Stable Diffusion 3.5 Large Turbo を含む複数のモデルバリエーションが含まれています。

echo hamletBOT Jan 15, 2025, 5:39 PM

#

@everyone Wanted to give a massive shout-out to @de_muedi, @utterly_butterly, and the entire @DNA-LLM team on their recent preprint Life as a Function: Why Transformer Architectures Struggle to Gain Genome-Level Foundational Capabilities.

It's still an early version and they plan to extend the work presented here as well as other downstream projects/applications.

Life as a Function: Why Transformer Architectures Struggle to Gain ...

Recent years have seen a flurry of generative nucleotide models, mostly of limited utility. In this paper, we use the functional representation of DNA as a complex, composite function on the plane of evolution to extend the theoretical unification of ecological and evolutionary change to the problem of synthetic DNA models. Through experiments o...

sterile cargoBOT May 14, 2025, 3:03 PM

#

@everyone

📢We're excited to announce the release of Stable Audio Open Small, now available for download on Hugging Face!📢

This is a smaller (341M parameters), more efficient version of our Stable Audio Open 1.0 model, optimized for quick inference.

To read about the new ARC post-training method we used to accelerate this model, check out our new research paper on arXiv!

We also partnered with Arm on this release to enable further optimization of the model for deployment on CPUs. You can check out their new learning path to see how you can enable fast edge deployment of this new model.

💻Weights: https://huggingface.co/stabilityai/stable-audio-open-small
📃Paper: https://arxiv.org/abs/2505.08175
🎓Arm learning path: https://learn.arm.com/learning-paths/mobile-graphics-and-gaming/run-stable-audio-open-small-with-lite-rt

stabilityai/stable-audio-open-small · Hugging Face

arXiv.org

Fast Text-to-Audio Generation with Adversarial Post-Training

Text-to-audio systems, while increasingly performant, are slow at inference time, thus making their latency unpractical for many creative applications. We present Adversarial Relativistic-Contrastive (ARC) post-training, the first adversarial acceleration algorithm for diffusion/flow models not based on distillation. While past adversarial post-...

old ventureBOT May 22, 2025, 6:53 PM

#

@everyone Let's say you're working with an AI as a co-scientist and you ask it to proof read the paper and report back on any mistakes. How likely is it to find real errors? We built SPOT, a benchmark of papers that were retracted or errata'd to find out! SPOT comprises 83 research papers with 91 author-validated error annotations across 10 STEM fields. We categorize errors into 6 error types – equation/proof, figure duplication, data inconsistency, statistical reporting, reagent identity, and experiment setup - and include information on where in the paper the error is, whether it lead to an erata or a retraction, and an expert-human authored description of the error.

We benchmarked 10 top models, both closed and open, and the results are sobering – the best results are from o3 which has a precision of 6% and a recall of 21%. All other models score below 4% precision and 10% recall. We also look at what happens when you run a model multiple times: across 8 trials models rarely discover the same errors and generally assign a near-zero confidence to their claims. The appendix contains breakdowns by field, error type, ablations for when figures are omitted, and more.

This work was lead by @GSON with contributions by @Honglu @mrgonao myself, and several others who aren't on Discord.

arXiv, Twitter thread, data

#😊｜co-creators

画像生成AI Stable Diffusion スタートガイド 予約受付中

Stability AI Japan × NVIDIA GTC24 開催記念キャンペーン実施中

Stable Code Instruct 3B リリース！

Stable Audio 2.0 デモサイト リリース！

【続】Stable Diffusion 3 API リリース！& 詳細な情報提供のお知らせ

Japanese Stable LM 2 1.6B リリース！

DiscordApp「Stable Artisan」リリース！

Stable Diffusion 3 medium リリース！

Stable Video 4D リリース！

Stable Diffusion 3.5 Large & Large Turbo リリース！

画像生成AI Stable Diffusion スタートガイド予約受付中

Stable Audio 2.0 デモサイトリリース！