小小林熬夜学编程

这个屌丝很懒，什么也没留下！

热门标签

这可能是最强AI文生图工具：Stable Diffusion 3 超详细测试_文生图测试方案

作者：小小林熬夜学编程 | 2024-06-03 00:14:13

踩

文生图测试方案

最近文生图领域最重要的消息，就是Stable Diffusion 3的推出。

目前，有两种使用Stable Diffusion 3的方法，一种是通过API调用，这需要在Stability AI开发者平台申请API Keys：

Stability AI开发者平台

在Google Colab上调用API进行绘图

另一种方法，是使用Stable Assistant聊天机器人（需申请），类似在ChatGPT里使用DALLE3：

通过Stable Assistant使用SD3

总之，目前两种方式都需要付费，10美元1000点数，只能画不到200张图，并不便宜。

那到底效果如何，今天就和Midjourney（V6）作一番详细对比：

美丽的魔女，黑色长发，穿着黑色高领套头衫和黑色瑜伽裤，在一个神奇的智能企鹅文明祭坛旁摆姿势，雕像，动画艺术风格，魔鬼核心，超现实插画，32k uhd，龙的艺术，燃烧的哥特式背景，超现实的人物。
Beautiful demoness with long black hair, dressed in a black turtleneck jumper and black yoga pants, posing next to a magic Intelligent Penguin Civilization altar, a statue, in the style of anime art, devil-core, hyper-realistic illustrations, 32k uhd, dragon art, flaming gothic background, hyper-realistic characters.

Stable Diffusion 3

Midjourney V6

两款工具的画风都比较精致，但MJ6没能体现“魔女”，SD3则加入了眼睛异色、头上长角的元素。

狮子肖像，黑白，逼真
lion portrait, black and white, photorealistic

Stable Diffusion 3

Midjourney V6

两款工具表现都很好，好到简直像是以同一只狮子的照片训练的。

贴纸设计的武士宫本武藏，他与樱花树站在一起，平静，安详，极简的线条插图，黑色的灰色和橙色，白色的背景
sticker design of samurai Miyamoto Musashi, he is standing with cherry blossoms tree, he is standing calm, serene, minimalistic line illustration, black gray and orange colors, white background

Stable Diffusion 3

Midjourney V6

相比之下，MJ6更符合“极简”的要求。

一堆垃圾，高得像一座山，活着的人类演员从里面伸出来，人类垃圾超写实，色彩鲜艳，现代，白色背景
pile of garbage, very tall like a mountain, alive human comedians sticking out of it, ala human garbage ultra realistic, colorful, modern, white background

Stable Diffusion 3

Midjourney V6

各有所长，但MJ6在人物的细节上面表现更好。

泰伦斯·马利克拍摄的现代电影，当代在热气球上野餐和西蒙妮·吉尔兹吃糖果的场景
Cinematic film still of modern contemporary picnic in a hot air balloon and a simone giertz eating a candy bar by terrence malik

Stable Diffusion 3

Midjourney V6

提示词包含具体的导演风格和演员形象，两款工具不相伯仲，但对于表现“吃东西”都有点困难。

**6.
**
Stable Diffusion 3

Midjourney V6

个人感觉都差不多，但MJ6的工人是白人，SD3的工人是黑人？

动作科幻电影中女主角的电影广角镜头，控制论增强，运动身体，未来主义，戏剧性的姿势，大胆的灯光，景深，引人注目的视觉效果
cinematic wide shot of a lead heroine in an action packed sci fi movie with cybernetic enhancements, athletic body, futuristic, dramatic pose, bold lighting, depth of field, striking visual effects

Stable Diffusion 3

Midjourney V6

都说MJ6的优势是艺术效果，但这次似乎是SD3的质感更佳。

1961年完全库存道奇动力旅行车展出在道奇经销商船回到1961年的质量照片质量
'61 completely stock Dodge Power Wagon on display at the dodge dealer ship back in the day, 1961 quality photo quality

Stable Diffusion 3

Midjourney V6

SD3在文字的还原上比较强。

一个全副武装的骑士坐在欧洲拥挤的地铁里，逼真的，用iPhone 13拍摄
a knight in full armor sitting in a crowded subway in europe, photorealistic, shot with an iphone 13

Stable Diffusion 3

Midjourney V6

两款工具的效果都令人满意。

感兴趣的小伙伴，赠送全套AIGC学习资料，包含AI绘画、AI人工智能等前沿科技教程和软件工具，具体看这里。

10.

一个被锁住的男人，拿着一把鹤嘴锄，在一个建筑工地上工作，他们正在建造未来主义的摩天大楼，这个男人站在那里，对着天空大喊大叫，地上有一片面包，照片，现实主义的照片，肖像
a chained man, holding a pickaxe, working on a construction site where they are building futuristic skyscrapers, the man is standing and shouting to the sky, a piece of bread on the ground, photo, realistic photo, portrait

Stable Diffusion 3

Midjourney V6

MJ6似乎无法很好地理解“被锁住”和“鹤嘴锄”，面包的数量也不对。SD3胜出。

11.

一个黑色的和尚坐在通往无边无际的星星的门口静静地冥想
a black monk in tranquil meditation sitting at the gateway to an infinity of endless stars

Stable Diffusion 3

Midjourney V6

很难说哪个更好，取决于用户的自身审美。

12.

设计师高跟鞋的特写照片，在一个令人难以置信的美丽豪宅和庄园修剪整齐的草坪上，美丽的日落照亮了天空和云彩。
close up photo of designer high heels being worn in the manicured grass lawn of an incredibly beautiful mansion and estate as a gorgeous sunset lights the sky and clouds.

Stable Diffusion 3

Midjourney V6

两款工具都有不错的表现，但需要经过“抽卡”。

13.

一位身着橙青色渐变荷叶裙的中国古代美女，化着华丽的妆容，戴着精致的发饰，站在以荷塘为背景的唐代建筑花园中。她头上戴着荷叶，穿着五颜六色、飘逸的衣袖。全身照片是用那个时代风格的高清摄影拍摄的。她举着一个牌子，上面写着“我是中国人”。
An ancient Chinese beauty wearing an orange and cyan gradient lotus leaf skirt, gorgeous makeup, and exquisite hair accessories stood in the garden of Tang Dynasty architecture with a lotus pond background. Lotus leaves were on her head and she wore colorful, flowing sleeves. The full body photos were taken with high-definition photography in the style of that period. She was holding a sign that said “I am Chinese”.

Stable Diffusion 3

Midjourney V6

妆容表现力差不多，MJ6生成的都是全景，SD3生成的则是近景。文字生成的准确性方面，SD3再次胜出。

14.

一座雕刻在山上的城堡入口的概念艺术，有一座桥与之相连
concept art of an entrance of a castle carved into a mountain, with a bridge connecting to it

Stable Diffusion 3

Midjourney V6

两款工具都较好地达成了创作要求。

15.

一个巨大的种植基地，蓝天，一个巨大的果园，一个小的绿色柑橘，极美的风景照片，风景摄影作品，广角拍摄，逼真的视觉效果，逼真的风景照片，超现实主义，丰富明亮的光源，美丽的光影，柔和的光源，低角度摄影，鸟瞰，高质量，高细节，8k
A 10000 acre planting base, a huge planting base, blue sky and a huge planting base, a huge orchard, small green citrus, extremely beautiful scenery photos, landscape photography works, wide-angle shooting, realistic visual effects, realistic landscape photos, surrealism, rich and bright light sources, beautiful light and shadow, soft light sources, low angle photography, bird’s-eye view, high quality, high detail, 8k

Stable Diffusion 3

Midjourney V6

MJ6的场景较为自然，SD3的植物排列有点过于整齐

16.

日本女团，年轻的少女，抚摸不同人的头发，直发，快乐的表情，浅色照片，肖像，时尚照片，50mm镜头，4k
Japanese girl group, young teenage women, touching different people’s hair, straight hair, happy expressions, light colour photo, portrait, fashion photo, 50mm lens, 4k

Stable Diffusion 3

Midjourney V6

SD3的图中规中矩，MJ6的“艺术修养”这次体现出来了。

17.

街头移动咖啡厅，木质框架，透明玻璃，现代简约，空间多变，结构简洁，细节精致，建筑艺术
Street mobile cafe, wooden frame, transparent glass, modern simplicity, variable space, simple structure, exquisite details, architectural art

Stable Diffusion 3

Midjourney V6

感觉MJ6的生成效果更自然。

18.

霓虹灯在白色的背景上使用粉红色、白色、浅蓝色、深蓝色
neon lights circle on white background using colours pink, white, light blue, dark blue

Stable Diffusion 3

Midjourney V6

虽然呈现的画面有区别，但都满足了提示词的要求。

19.

炫酷蜘蛛侠的“射网”姿势
Cool Spider-Man’s “web shooter” pose

Stable Diffusion 3

Midjourney V6

SD3的服装质感更好。

感兴趣的小伙伴，赠送全套AIGC学习资料，包含AI绘画、AI人工智能等前沿科技教程和软件工具，具体看这里。

20.

可爱的皮克斯《哈利波特》海报，迪士尼logo上有“哈利波特”字样的文字，哈利波特手持魔杖，指向前方，进入战斗位置，大大的眼睛，没有雀斑，背景是霍格沃茨学校的城堡，一些巫师骑着扫帚，皮克斯风格，色彩鲜艳
Cute Pixar poster of Harry Potter, Disney logo with the words “Harry Potter” in the text, Harry Potter holding a magic wand, Point ahead and get into battle position, big eyes, No freckles, In the background are Hogwarts School castle, Some wizards ride broomsticks, Pixar style, bright colors

Stable Diffusion 3

Midjourney V6

两款工具表现都不完美，MJ6的英文有问题，SD3的手指有问题。

至此，可以对Stable Diffusion 3给出一个初步的结论：

1、对提示词的理解（跟随度）比之前版本有了明显进步，但尚未达到DALLE3的程度。

2、表现文字（英文）的能力，比Midjourney更强。

3、审美能力（美观度）略逊于Midjourney。

4、人体（尤其手指）较容易崩坏，相信开源后可借助插件解决。

5、目前功能较简单，并不支持局部重绘等

综合来看，SD3具有相当大的潜力。尤其Stability AI承诺会坚持开放原则，在不久的将来使得SD3模型可以本地部署。

基于目前的模型，Stable Diffusion如果接入各种插件，成为最强文生图工具，并不是梦。尤其是…本地部署还有更多的“创作自由”！

写在最后

感兴趣的小伙伴，赠送全套AIGC学习资料，包含AI绘画、AI人工智能等前沿科技教程和软件工具，具体看这里。

AIGC技术的未来发展前景广阔，随着人工智能技术的不断发展，AIGC技术也将不断提高。未来，AIGC技术将在游戏和计算领域得到更广泛的应用，使游戏和计算系统具有更高效、更智能、更灵活的特性。同时，AIGC技术也将与人工智能技术紧密结合，在更多的领域得到广泛应用，对程序员来说影响至关重要。未来，AIGC技术将继续得到提高，同时也将与人工智能技术紧密结合，在更多的领域得到广泛应用。

在这里插入图片描述