aslbeaver / README.md

Update README.md

4344259 verified 5 months ago

7.2 kB

	---
	license: cc-by-nc-4.0
	language:
	- en
	base_model:
	- MidnightRunner/MDNT_Illus
	tags:
	- SDXL
	- text-to-image
	- ComfyUI
	- Automatic1111
	- Diffuser
	- educational
	- sign-language
	- mascot
	pipeline_tag: text-to-image
	library_name: diffusers
	metrics:
	- FID
	- IS
	widget:
	- text: >-
	(high-quality cartoon illustration), (vibrant colors), (smooth gradients),
	(polished details), (child-friendly design), aslbeaver, asl_signing
	parameters:
	negative_prompt: (ugly, worst quality, extra limbs, lowres, watermark)
	output:
	url: aslbeaver_e000002_01_20250415205323.png
	---

	# Model Card for aslbeaver

	The `aslbeaver` LoRA was developed by [midnight1111](https://civitai.com/user/midnight1111) to generate a cartoon beaver mascot designed to demonstrate ASL (American Sign Language) gestures in educational game environments. The model supports expressions like "phone_on_forehead" for gesture guessing games and animated classroom scenes.

	## Model Details

	### Model Description

	Model Type: Diffusion-based text-to-image LoRA model built on SDXL

	Checkpoint Compatibility: This LoRA works best with SDXL checkpoints such as Illustrious-XL and is compatible with ComfyUI and Automatic1111.

	Style & Focus: Designed for expressive cartoon generation focused on educational content, particularly American Sign Language (ASL) gesture storytelling.


	- Model Name: aslbeaver
	- Developed by: [midnight1111](https://civitai.com/user/midnight1111)
	- Finetuned from: https://civitai.com/models/981618?modelVersionId=1405366
	- Base Model: https://huggingface.co/MidnightRunner/MDNT_Illus
	- Model Type: LoRA (Low-Rank Adaptation)
	- License: CC BY-NC 4.0

	### Model Sources

	- Demo Use: https://www.playwithasl.com
	- Repository: https://huggingface.co/MidnightRunner/aslbeaver

	## Prompt Format

	### Positive Prompt Example

	aslbeaver, phone_on_forehead, gaming_mode, cheerful expression, bright classroom, tail behind

	### Negative Prompt Example

	lowres, worst quality, low quality, bad anatomy, bad hands, 4koma, comic, greyscale, censored, jpeg artifacts, overly saturated, overly vivid, (multiple views:1.1), (bad:1.05), fewer, extra, missing, worst quality, jpeg artifacts, bad quality, watermark, unfinished, displeasing, sepia, sketch, flat color, signature, artistic error, username, scan, (blurry, lowres, worst quality, (low quality:1.1), ugly, (bad anatomy:1.05), artist name, (patreon username:1.2), elongated torso, anime blush,

	## Settings

	- Sampling Methods: Euler A, DPM++ 2M Karras, DPM++ 3M SDE
	- Sampling Steps: 25–30
	- CFG Scale: 5.0
	- Clip Skip: 1

	## Uses

	### Direct Use
	- Generate cartoon mascot images for ASL guessing games or classroom visuals.

	### Downstream Use
	- Educational apps, gesture-based gameplay designs, comics, posters, signage.

	## Training Details
	````python
	engine: kohya
	resolution: 1024
	epochs: 13
	batch_size: 6
	repeats_per_image: 9
	bucket_enabled: true
	scheduler:
	type: cosine_with_restarts
	cycles: 3

	optimizer:
	type: AdamW8Bit

	learning_rate:
	unet: 0.0005
	text_encoder: 0.00005

	network:
	dim: 32
	alpha: 16

	precision:
	use_fp16: true
	noise_offset: 0.1
	min_snr_gamma: 5

	clip_skip: 1
	keep_tokens: 1
	shuffle_caption: true
	````

	### Inference Sample (Diffusers)

	````python
	from diffusers import StableDiffusionPipeline
	import torch

	# Load base model
	base_model = "path_to_your_sdxl_or_checkpoint"

	pipe = StableDiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.float16)
	pipe.to("cuda")

	# Load the aslbeaver LoRA
	pipe.load_lora("aslbeaver.safetensors", alpha=0.7)

	# Define prompt
	prompt = "aslbeaver, phone_on_forehead, gaming_mode, cheerful expression, bright classroom, tail behind"
	negative_prompt = "(ugly, worst quality, extra limbs, lowres, watermark)"

	# Run pipeline
	result = pipe(prompt=prompt, negative_prompt=negative_prompt).images[0]
	result.save("aslbeaver_result.png")
	````


	## How to Use in ComfyUI

	1. Load your base SDXL checkpoint in a `CheckpointLoaderSimple` or similar node.
	2. Use `LoraLoader` or `LoraStack` node to load `aslbeaver.safetensors`.
	- Set `strength_model` to around `0.8`.
	- Set `strength_clip` to `1.0` for full text embedding effect.
	3. Use a standard SDXL pipeline with `Positive Prompt`, `Negative Prompt`, and Sampler setup as needed.
	4. Render as usual.

	---

	## How to Use in Automatic1111 (A1111)

	1. Copy `aslbeaver.safetensors` into:
	```
	/models/Lora/
	```
	2. In the txt2img or img2img interface, enter the LoRA trigger:
	```
	<lora:aslbeaver:0.8>
	```
	3. Combine with your prompt:
	```
	aslbeaver, phone_on_forehead, gaming_mode, cheerful expression, bright classroom, tail behind
	```
	4. Add your negative prompt and adjust CFG, steps, etc.

	---

	## Evaluation

	- Evaluated visually across all 13 epochs
	- Best results achieved at epoch 11–13 for styling clarity

	## Wildcard System: `beaver_wildcards.zip`

	This LoRA comes with a complete wildcard tagging system to support flexible, randomized prompt construction. You can use this in both A1111 and ComfyUI environments.

	### Included Wildcard Files

	- `asl_signing.txt`
	- `background_scene.txt`
	- `body_proportions.txt`
	- `clothing.txt`
	- `core_traits.txt`
	- `emotion_tags.txt`
	- `expression_blocks.txt`
	- `expression_pose.txt`
	- `gesture_tags.txt`
	- `purpose_context.txt`
	- `style_rendering.txt`

	These files contain theme-specific wildcard entries for generating prompts in a modular, randomized fashion. For example, `gesture_tags.txt` includes ASL-style movements like "wave," "tap head," or "cover mouth."

	### Example Prompt Syntax
	```txt
	aslbeaver, __gesture_tags__, __clothing__, __emotion_tags__, phone_on_forehead
	```
	This syntax will randomly sample one entry from each wildcard file referenced.

	### Using in ComfyUI
	- Install: [`ComfyUI-Wildcards`](https://github.com/ltdrdata/ComfyUI-Impact-Pack)
	- Path: `/ComfyUI/custom_nodes/ComfyUI-Impact-Pack/custom_wildcards/beaver_wildcards/`
	- Place `.txt` files there and use the `Wildcard` node to insert `__beaver_wildcards/__filename__` pattern.

	### Using in AUTOMATIC1111 (A1111)
	- Install: [`stable-diffusion-webui-wildcards`](https://github.com/AUTOMATIC1111/stable-diffusion-webui-wildcards)
	- Path: `/extensions/stable-diffusion-webui-wildcards/wildcards/beaver_wildcards/`
	- Then reference wildcards as `__beaver_wildcards/filename__` inside your prompt text field.

	Wildcarding helps automate image variation for testing, prompt diversity, or simulation of batch.

	## Environmental Impact

	- Hardware: 1x RTX 3090 (24GB VRAM)
	- Runtime: ~12 hours
	- Emissions: Low

	## Model Card Authors

	[midnight1111](https://civitai.com/user/midnight1111)

	## Model Card Contact

	For support, inquiries, or demo collaboration:
	- 📧 [email protected]
	- 🌐 [PlayWithASL.com](https://www.playwithasl.com)
	- 🧠 [Midnight on Civitai](https://civitai.com/user/midnight1111)

	---

	This model is intended to assist in visual communication and cartoon asset creation. Always validate generated ASL signs before public or instructional use.

	2025 © Midnight1111 / MidnightRunner — [MIT License or CC BY-NC 4.0]