Home
Drive ChatGPT and Gemini from Python — no API keys, no billing, just the free web UI.
ChatGPT and Gemini are incredibly capable — but their official APIs are expensive, and for many tasks you simply don't need them. If you want to run OCR on an image, generate artwork, extract text from a screenshot, or just ask a quick question in a script, paying per-token for API access is overkill when the free web UI can do the same thing.
Hermex lets you automate ChatGPT and Gemini with Python — no API keys, no billing, no paywalls. It opens a real Chrome browser, types your message, uploads your files, waits for the response, and hands it back to you as a Python object, just like a human would.
from hermex import ChatGPT
response = ChatGPT.simple_query(
"What does this receipt say?",
attachments=["receipt.jpg"]
)
print(response.text)
Why Hermex?¶
- No API keys — uses the same free web UI you already have access to
- File support — upload images, PDFs, CSVs, text files, and more; download AI-generated images
- Bot detection evasion — built on
undetected-chromedriverwith simulated human typing - Persistent sessions — log in once, reuse the session across all future runs
- Fluent interface — chain method calls for clean, readable scripts