[360Labs.ai]
0%
Back to Portfolio
DeepForge
Research and Development|Live

DeepForge

End-to-end synthetic video generation platform. Face-swap (InsightFace), voice cloning (Coqui XTTS-v2), lip-sync (Wav2Lip), and face restoration (GFPGAN) unified into a single production pipeline.

Overview

DeepForge orchestrates open-source models into a unified 5-pipeline video generation engine. Face-swap, voice cloning, lip-sync, full video generation, and detection adversary scoring.

Specs

Face-SwapInsightFace + inswapper_128 + GFPGAN (70M params)
VoiceCoqui XTTS-v2 (467M params) from 30s reference
Lip-SyncWav2Lip (12M params)
Output512x512, 30 FPS, 33ms/frame

Features

  • RetinaFace detection, InsightFace 512-dim embedding, inswapper_128 face replacement
  • GFPGAN face restoration with Poisson seamless blending
  • Coqui XTTS-v2 multilingual voice cloning with HiFi-GAN vocoder
  • Wav2Lip synchronizing generated speech with facial keypoints
  • End-to-end orchestration with frame-level A/V sync, H.264 encoding

Tech Stack

InsightFaceGFPGANCoqui XTTSWav2LipHiFi-GANPyTorchCUDA

Interested in DeepForge?

Let's discuss how this can work for you.

Get in Touch