Table of Contents

<aside>

Overview

An AI faceless video generator SaaS allows users to create professional-quality videos programmatically using AI, based on the Description and theme the user provides.

In this guide, we’ll focus on integrating Editly a versatile open-source video editing library, that allows you to programmatically generate videos by combining different types of media clips (text, audio, video, images) and adding transitions and effects.

</aside>

<aside>

Key Features of the AI Faceless Video Generator

Before we get into the technical details, here are the key features your platform will offer:

1. Tech Stack

Purpose Tool/Framework Name Website
Web App NextJS nextjs.org/
Auth & Database Supabase supabase.com
Programmatic Video Creation Editly Github Repo
Advanced Programmatic Video Creation (Alternatives) Diffusion Studio, Remotion Diffusion Studio , Remotion

2. AI APIs, Models To Use

API/Model Name Description Website
OpenAI API To create Script & captions openai.com
Flux AI Image Generation flux dev
LTX-Video (Optional) Image to Video Generation ltx-video
Eleven labs Text to Speech Generation for Voiceovers ElevenLabs.io

3. Frontend and User Interface

  1. New Video Gen Section - For MVP build a simple Form UI to allow users to input their video topic, select options for voiceover, visuals, and more.
  2. Overview Section - Inform users about the status of video creation (e.g., generating script, creating visuals, processing video). Number of Video Generation credits left in the current billing cycle
  3. History Section - Collection of users previously generated videos that provide users with a link to download or share their video.
  4. Billing Section - Allow users to Upgrade or Downgrade their plan
  5. Profile Section - Allow users to Update their Personal information here, update passwords & email

Untitled-2024-12-17-2332.png

4. Backend and Video Generation

The backend for Video Creation with Editly involves several important steps:

  1. Processing requests from the frontend.
  2. Generating content (script, voiceover, images) using AI APIs.
  3. Video Composing with Editly
  4. Task Management for Video Processing (Optional)

Here’s a step-by-step breakdown of the backend process:

1. Processing requests from Frontend

The frontend sends a request (via an HTTP POST) to your backend with the necessary information (e.g., topic for the video, which could be used to generate a script). This data is received by the API layer.

2. Generating Content

Once the backend receives the request, it triggers calls to different AI APIs to generate the required assets:

3. Video Composing with Editly

Once all the necessary assets (script, voiceover, images) are generated, the backend uses Editly to combine them into a video.

4. Task Management for Video Processing (Optional)

If video generation is resource-intensive and may take a long time (minutes to hours), consider using a task queue to handle video generation asynchronously:

5. Total Monthly Cost (Hosting + API Fees)

Tool Cost Cost per Video (Approx. for 1 minute Video)
**OpenAI API
GPT-4o Model** $0.01 / 1k Tokens $0.065 - $0.08
2k input tokens
5-6k tokens to generate Structured JSON outputs
Flux $0.025 / image $0.25 - $0.3
10-12 images per video
LTX-Video (Optional) $0.026 / video(6 seconds clip) $0.26 - $0.3
10-12 video clips per video
Eleven labs $0.11 / minute $0.11 - $0.16
Cost may vary depending on your subscription plan
Total Cost $0.6 - $0.8 per video

6. Is there room for a new player?

The broader market is crowded, but niches often remain underserved. Focus on a specific audience or use case, such as:

Current Players:

Crayo.ai

Revid.ai

Autoshorts.ai

Invideo AI