Models

Neets.ai supports multiple different generative models for text and audio:

Model Name	Type	Description	Cost
style-diff-500	TTS	Speech synthesis model with style diffusion and adversarial training	$0.005/1k characters
vits	TTS	VITS (https://arxiv.org/abs/2106.06103) is a popular end-to-end (one-stage) TTS model. Our hosted VITS TTS API features ultra-fast inference, the largest offering of languages (88) and the lowest price across all vendors.	$0.001/1k characters
ar-diff-50k	TTS	Tortoise-style AR+diffusion model	$0.03/1k characters
Neets-7B	LLM	Mistral-7B fork	$0.55/million LLM tokens
mistralai/Mixtral-8X7B-Instruct-v0.1	LLM	Pretrained generative Sparse Mixture of Experts	$0.55/million LLM tokens

We are always improving our services and working on new models. Follow us on X or join our Discord to be the first to know when we update or add new models.