Experimental Anima LLLite Regional Controlnet

Apply Anima ControlNet-LLLite parameters: (https://github.com/kohya-ss/ComfyUI-Anima-LLLite)

Region Color Mask + Output image: (Use basic colors for different region)

Usage

Use a color mask image as the conditioning input.
Any color can be used to define a region. There is no fixed palette or strict RGB requirement.
The mask background should be white.
Using simple solid colors (red, green, blue, yellow, etc.) for different regions is recommended for clarity.
The model was trained using manually masked conditioning images and therefore expects clearly separated regions.

Prompting

Normal prompting works fine.

However, the model currently cannot determine which prompt corresponds to which colored region automatically. It only receives the region mask as additional conditioning and does not perform explicit prompt-to-region matching.

To guide concepts into specific regions, use spatial prompts such as:

"girl on the left, cat on the right"
"character in the foreground, city skyline in the background"

Combining this model with attention masking methods such as Forge Couple or the Attention Couple node can provide stronger prompt-to-region associations.

Training

Trained on 580 images, 2 repeats, batch size 4, for 4400 steps
Conditioning images were manually masked for each image
Captions were generated with the help of wd-eva02-large-tagger v3 and https://github.com/pythongosssss/ComfyUI-WD14-Tagger custom node
Trained using https://github.com/kohya-ss/sd-scripts

Limitations

Because most training images consisted of close-up character compositions, generations involving distant subjects may not strictly adhere to the provided mask boundaries.

Downloads last month: -

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Sen-sou/Anima-LLLite-Regional-Controlnet

Base model

nvidia/Cosmos-Predict2-2B-Text2Image

Finetuned

circlestone-labs/Anima

Adapter

(39)

this model