File:X-Y plot of algorithmically-generated AI art demonstrating Hypernetworks.png
原始檔案 (5,952 × 3,197 像素,檔案大小:22.01 MB,MIME 類型:image/png)
摘要
描述X-Y plot of algorithmically-generated AI art demonstrating Hypernetworks.png |
An X/Y plot of algorithmically-generated AI artworks depicting a woman in various different settings, created using a custom-trained anime-focused Stable Diffusion-based model known as "Anything V3.0" (with hash 1a7df6b8) created by Furqanil Taqwa. This plot serves to demonstrate the usage of Hypernetworks, a technique created by Kurumuz in 2021 which allows Stable Diffusion-based image generation models to imitate the art style of specific artists, even if the artist is not recognised by the original diffusion model, by applying a small neural network at various points within the larger network. Hypernetworks are small pre-trained neural networks that steer results towards a particular direction, for example applying visual styles and motifs, when used in conjunction with a larger neural network. The Hypernetwork processes the image by finding key areas of importance such as hair and eyes, and patches them in secondary latent space. They are significantly smaller in filesize compared to DreamBooth models, another method for fine-training AI diffusion models, making Hypernetworks a viable alternative to DreamBooth models in some, but not all, use-cases. Hypernetwork training also requires only 6GB of VRAM, compared to the ~20GB VRAM required for DreamBooth training (although this VRAM requirement can be lowered using DeepSpeed). A downside to Hypernetworks is that they are comparatively less flexible and accurate, and can sometimes lead to unpredictable results. For this reason, Hypernetworks are suited towards applying visual style or cleaning up blemishes in human anatomy, while DreamBooth models are more adept at depicting specific user-defined subjects.
These images were generated using an NVIDIA RTX 4090; since Ada Lovelace chipsets (using compute capability 8.9, which requires CUDA 11.8) are not fully supported by the pyTorch dependency libraries currently used by Stable Diffusion, I've used a custom build of xformers, along with pyTorch cu116 and cuDNN v8.6, as a temporary workaround. Front-end used for the entire generation process is Stable Diffusion web UI created by AUTOMATIC1111. Hypernetworks trained on the artstyles of the following artists were used:
A batch of 768x1024 images were generated with txt2img using the following prompts:
During the generation of this batch, an X/Y plot was generated using the "X/Y plot" txt2img script, along with the following settings:
|
日期 | |
來源 | 自己的作品 |
作者 | Benlisquare |
授權許可 (重用此檔案) |
As the creator of the output images, I release this image under the licence displayed within the template below.
The Stable Diffusion AI model is released under the CreativeML OpenRAIL-M License, which "does not impose any restrictions on reuse, distribution, commercialization, adaptation" as long as the model is not being intentionally used to cause harm to individuals, for instance, to deliberately mislead or deceive, and the authors of the AI models claim no rights over any image outputs generated, as stipulated by the license.
Anything V3.0, created by Furqanil Taqwa, is released under the CreativeML OpenRAIL-M License.
|
File history includes unedited version(s) The upload history for this file includes one or more unedited versions of the file uploaded shortly before the final version was uploaded. The unedited versions are not intended to be used independently, and should not be split out as separate files unless this is needed for a specific known use. |
授權條款
- 您可以自由:
- 分享 – 複製、發佈和傳播本作品
- 重新修改 – 創作演繹作品
- 惟需遵照下列條件:
- 姓名標示 – 您必須指名出正確的製作者,和提供授權條款的連結,以及表示是否有對內容上做出變更。您可以用任何合理的方式來行動,但不得以任何方式表明授權條款是對您許可或是由您所使用。
- 相同方式分享 – 如果您利用本素材進行再混合、轉換或創作,您必須基於如同原先的相同或兼容的條款,來分布您的貢獻成品。
已授權您依據自由軟體基金會發行的無固定段落、封面文字和封底文字GNU自由文件授權條款1.2版或任意後續版本,對本檔進行複製、傳播和/或修改。該協議的副本列在GNU自由文件授權條款中。http://www.gnu.org/copyleft/fdl.htmlGFDLGNU Free Documentation Licensetruetrue |
在此檔案描寫的項目
描繪內容
沒有維基數據項目的某些值
GNU自由文檔許可證1.2或更高版本 繁體中文 (已轉換拼寫)
3 12 2022
多媒體型式 繁體中文 (已轉換拼寫)
image/png
檔案歷史
點選日期/時間以檢視該時間的檔案版本。
日期/時間 | 縮圖 | 尺寸 | 用戶 | 備註 | |
---|---|---|---|---|---|
目前 | 2022年12月4日 (日) 17:05 | 5,952 × 3,197(22.01 MB) | Benlisquare | inpaint ugly hands: done | |
2022年12月4日 (日) 14:13 | 5,952 × 3,197(21.89 MB) | Benlisquare | inpaint ugly hands WIP | ||
2022年12月4日 (日) 01:19 | 5,952 × 3,197(21.82 MB) | Benlisquare | inpaint ugly hands WIP (this process takes hours, will finish later) | ||
2022年12月3日 (六) 22:20 | 5,952 × 3,197(21.82 MB) | Benlisquare | {{Information |Description= An X/Y plot of algorithmically-generated AI artworks depicting a woman in various different settings, created using a custom-trained anime-focused Stable Diffusion-based model known as "[https://huggingface.co/Linaqruf/anything-v3.0 Anything V3.0]" (with hash 1a7df6b8) created by [https://huggingface.co/Linaqruf Furqanil Taqwa]. This plot serves to demonstrate the usage of Hypernetworks, a [https://blog.novelai.net/novelai-improvements-on-stable-diffusion-e10d38db8... |
檔案用途
下列2個頁面有用到此檔案:
詮釋資料
此檔案中包含擴展的資訊。這些資訊可能是由數位相機或掃描器在建立時或數位化過程中所加入。
如果此檔案的來源檔案已被修改,一些資訊在修改後的檔案中將不能完全反映出來。
水平解析度 | 28.35 dpc |
---|---|
垂直解析度 | 28.35 dpc |
檔案修改日期時間 | 2022年12月4日 (日) 17:02 |