Stable Diffusion 3 Medium ファインチューニング ガイド

SD3 でこのような画像を生成してみたいと思ったことはありますか?

Prompt

a three fourth perspective portrait view of a young woman with messy blonde hair and light purple eyes, looking at viewer with a closed mouth smile, slightly visible right pointy fantasy ear, wearing a black feather hair tie on right side of hair, wearing a pink feather above right ear, wearing silver earrings, wearing baggy white collared shirt with a black cloak wrapped around shoulders, bright yellow rim light hitting left side of face, cropped, a faded pink simple background during golden hour

Prompt

A person stands in the foreground with their back turned to the camera, appearing to be about to enter a doorway. They have short hair and are dressed in casual clothing. The background features a misty, dimly-lit street lined with cars, old building facades, and a brightly lit gas station sign that reads "iperoil" with prices 1.775 and 1.699 visible. The style conveys a gritty, realistic urban environment, highlighted by the vintage design of the gas station sign. The scene appears to be set late at night or early dawn, with moody, greenish lighting shrouded in fog, giving a sense of quiet solitude and contemplation.

Prompt

a front wide view of a small cyberpunk city with futuristic skyscrapers with gold rooftops situated on the side of a cliff overlooking an ocean, day time view with green tones, some boats floating in the foreground on top of reflective orange water, large mechanical robot structure reaching high above the clouds in the far background, atmospheric perspective, teal sky

こんにちは!私はStability AIのGenerative Media Solutions Engineer(およびフリーランスの2D/3Dコンセプトデザイナー)のYeo Wangです。YouTubeで動画を見たことがある方や、コミュニティ(Github)でご存知の方もいらっしゃるかもしれません。

今回、SD3 Medium をトレーニングするときの良い結果を得ることができたので、その洞察と、フルファインチューニングおよび LoRA トレーニングのクイックスタート設定をご紹介します。

興味があるエンジニア、または技術の知識のある方、ファインチューニングについての基本的な知識のある方はぜひこちらをご覧ください。

また、次期画像モデルのプレビューもお見逃しなく!

Next
Next

Lenovo Creator Zone にStability AI の画像生成モデルが登場