Neets.ai supports multiple different generative models for text and audio:
Model Name | Type | Description | Cost |
---|---|---|---|
style-diff-500 | TTS | Speech synthesis model with style diffusion and adversarial training | $0.005/1k characters |
vits | TTS | VITS (https://arxiv.org/abs/2106.06103) is a popular end-to-end (one-stage) TTS model. Our hosted VITS TTS API features ultra-fast inference, the largest offering of languages (88) and the lowest price across all vendors. | $0.001/1k characters |
ar-diff-50k | TTS | Tortoise-style AR+diffusion model | $0.03/1k characters |
Neets-7B | LLM | Mistral-7B fork | $0.55/million LLM tokens |
mistralai/Mixtral-8X7B-Instruct-v0.1 | LLM | Pretrained generative Sparse Mixture of Experts | $0.55/million LLM tokens |
We are always improving our services and working on new models. Follow us on X or join our Discord to be the first to know when we update or add new models.