About Rewind.ai

400+ AI tools, 400+ models and 100+ languages, all in one place. No credit card required to start.

You should be able to run AI tools and see real results before you decide to pay for anything. That is the whole premise. We built the infrastructure to make it possible at no cost to you.

Why We Built This

GPU compute is expensive, and most AI products cover that cost with $20-50/month subscriptions you have to commit to upfront. That is a hard ask when you have not yet confirmed the tool fits your workflow.

By owning our GPU infrastructure and running open-source models directly, we avoid per-API-call fees to third parties. Qwen 2.5, FLUX, Kokoro and Whisper cover most use cases well. Because we only pay for the hardware (not per inference), a genuinely free tier is financially viable for us.

Premium models such as GPT-4, Claude and Gemini carry real API costs we pass through at cost. Those are billed per token with no subscription attached. You pay only when you use them.

How It Works

Free tokens: every account starts with 10,000 tokens; 5,000 more are added each day. No card needed.
Self-hosted models: Qwen 2.5, FLUX, Kokoro, Whisper and others run on our own GPU servers at the lowest token cost.
400+ external models: GPT-4, Claude, Gemini and hundreds of others from major providers, each billed per token at cost.
One token system: a single balance covers every tool; the token cost for each request is displayed before you submit.

How We Operate

Open Source Models

Every model we host carries an MIT or Apache 2.0 license. Outputs belong to you for personal or commercial use.

Clear Pricing

Token cost is shown before each request. Self-hosted models carry no markup. External models are passed through at provider cost with no surcharge.

Privacy

We do not sell your data and do not use your inputs to train models. Requests processed by self-hosted models never leave our servers.

Honest Metrics

Numbers on the site reflect actual usage. If a feature is not ready, we say so rather than list it as available.

Tech Stack

The frontend is Next.js; the backend runs on Django 6.x. Model inference is handled by a FastAPI service on NVIDIA A100 GPUs hosted at Vultr. PostgreSQL manages persistent data, Redis handles caching and Nginx routes traffic. Self-hosted models run under their respective open licenses (Apache 2.0, MIT, OpenRAIL).

Looking for the old Rewind?

This site is not affiliated with the original Rewind AI (the Mac app and Pendant, later Limitless). That product shut down in December 2025 after Meta acquired Limitless — read the full story on what happened to Rewind AI.

Contact

Questions or issues? Email us at [email protected] or use the contact page.

Explore 400+ Tools View Pricing