ComfyUI GeminiOllama Extension

This extension integrates Google's Gemini API, Ollama, and various image processing tools into ComfyUI, allowing users to leverage these powerful models and features directly within their ComfyUI workflows.

Features

Support for Gemini and Ollama APIs
Text and image input capabilities
Streaming option for real-time responses
FLUX Resolution tools for image sizing
ComfyUI Styler for advanced styling options
Raster to Vector (SVG) conversion
Text splitting and processing
Easy integration with ComfyUI workflows

Nodes

1. Gemini API

The Gemini API node allows you to interact with Google's Gemini models:

Text input field for prompts
Model selection:
- gemini-1.5-pro-latest
- gemini-1.5-pro-exp-0801
- gemini-1.5-flash
Streaming option for real-time responses

2. Ollama API

Integrate local language models running via Ollama:

Text input field for prompts
Dropdown for selecting Ollama models
Customizable model options

3. FLUX Resolutions

Provides advanced image resolution and sizing options:

Predefined resolution presets (e.g., 768x1024, 1024x768, 1152x768)
Custom sizing parameters:
- size_selected
- multiply_factor
- manual_width
- manual_height

4. ComfyUI Styler

Extensive styling options for various creative needs:

🎨 General Arts – A broad spectrum of traditional and modern art styles 🌸 Anime – Bring your designs to life with anime-inspired aesthetics 🎨 Artist – Channel the influence of world-class artists 📷 Camera – Fine-tune focal lengths, angles, and setups 📐 Camera Angles – Add dynamic perspectives with a range of angles 🌟 Aesthetic – Define unique artistic vibes and styles 🎞️ Color Grading – Achieve rich cinematic tones and palettes 🎬 Movies – Get inspired by different cinematic worlds 🖌️ Digital Artform – From vector art to abstract digital styles 💪 Body Type – Customize different body shapes and dimensions 😲 Reactions – Capture authentic emotional expressions 💭 Feelings – Set the emotional tone for each creation 📸 Photographers – Infuse the style of renowned photographers 💇 Hair Style – Wide variety of hair designs for your characters 🏛️ Architecture Style – Classical to modern architectural themes 🛠️ Architect – Designs inspired by notable architects 🚗 Vehicle – Add cars, planes, or futuristic transportation 🕺 Poses – Customize dynamic body positions 🔬 Science – Add futuristic, scientific elements 👗 Clothing State – Define the wear and tear of clothing 👠 Clothing Style – Wide range of fashion styles 🎨 Composition – Control the layout and arrangement of elements 📏 Depth – Add dimensionality and focus to your scenes 🌍 Environment – From nature to urban settings, create rich backdrops 😊 Face – Customize facial expressions and emotions 🦄 Fantasy – Bring magical and surreal elements into your visuals 🎃 Filter – Apply unique visual filters for artistic effects 🖤 Gothic – Channel dark, mysterious, and dramatic themes 👻 Halloween – Get spooky with Halloween-inspired designs ✏️ Line Art – Incorporate clean, bold lines into your creations 💡 Lighting – Set the mood with dramatic lighting effects ✈️ Milehigh – Capture the essence of aviation and travel 🎭 Mood – Set the emotional tone and atmosphere 🎞️ Movie Poster – Create dramatic, story-driven poster designs 🎸 Punk – Channel bold, rebellious aesthetics 🌍 Travel Poster – Design vintage travel posters with global vibes

5. Raster to Vector (SVG) and Save SVG

Convert raster images to vector graphics and save them:

Raster to Vector node parameters:

colormode
filter_speckle
corner_threshold
... (and more)

Save SVG node options:

filename_prefix
overwrite_existing

6. TextSplitByDelimiter

Split text based on specified delimiters:

Input text field
Delimiter options:
- split_regex
- split_every
- split_count

Installation

Clone this repository into your ComfyUI's custom_nodes directory:

cd /path/to/ComfyUI/custom_nodes
git clone https://github.com/yourusername/GeminiOllama.git

Install the required dependencies:

pip install google-generativeai requests vtracer

Configuration

Gemini API Key Setup

Go to the Google AI Studio.
Create a new API key or use an existing one.
Copy the API key.
Create a config.json file in the extension directory with the following content:
```
{
  "GEMINI_API_KEY": "your_api_key_here"
}
```

Ollama Setup

Install Ollama by following the instructions on the Ollama GitHub page.
Start the Ollama server (usually runs on http://localhost:11434).

Add the Ollama URL to your config.json:

{
  "GEMINI_API_KEY": "your_api_key_here",
  "OLLAMA_URL": "http://localhost:11434"
}

Usage

After installation and configuration, a new node called "Gemini Ollama API" will be available in ComfyUI.

Input Parameters

api_choice: Choose between "Gemini" and "Ollama"
prompt: The text prompt for the AI model
gemini_model: Select the Gemini model (for Gemini API)
ollama_model: Specify the Ollama model (for Ollama API)
stream: Enable/disable streaming responses
image (optional): Input image for vision-based tasks

Output

text: The generated response from the chosen AI model

Main Functions

get_gemini_api_key(): Retrieves the Gemini API key from the config file.
get_ollama_url(): Gets the Ollama URL from the config file.
generate_content(): Main function to generate content based on the chosen API and parameters.
generate_gemini_content(): Handles content generation for Gemini API.
generate_ollama_content(): Manages content generation for Ollama API.
tensor_to_image(): Converts a tensor to a PIL Image for vision-based tasks.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
RMBG-1.4		RMBG-1.4
data		data
.gitignore		.gitignore
BRIA_RMBG.py		BRIA_RMBG.py
ComfyUI_GeminiOllama_Extension_README.md		ComfyUI_GeminiOllama_Extension_README.md
FLUXResolutions.py		FLUXResolutions.py
GeminiOllamaNode.py		GeminiOllamaNode.py
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
briarmbg.py		briarmbg.py
config.json		config.json
prompt_styler.py		prompt_styler.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
sizes.json		sizes.json
svgnode.py		svgnode.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ComfyUI GeminiOllama Extension

Features

Nodes

1. Gemini API

2. Ollama API

3. FLUX Resolutions

4. ComfyUI Styler

5. Raster to Vector (SVG) and Save SVG

6. TextSplitByDelimiter

Installation

Configuration

Gemini API Key Setup

Ollama Setup

Usage

Input Parameters

Output

Main Functions

Contributing

License

About

Releases

Packages

Contributors 3

Languages

License

al-swaiti/ComfyUI-OllamaGemini

Folders and files

Latest commit

History

Repository files navigation

ComfyUI GeminiOllama Extension

Features

Nodes

1. Gemini API

2. Ollama API

3. FLUX Resolutions

4. ComfyUI Styler

5. Raster to Vector (SVG) and Save SVG

6. TextSplitByDelimiter

Installation

Configuration

Gemini API Key Setup

Ollama Setup

Usage

Input Parameters

Output

Main Functions

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages