How do you use this tool?
- Drag a video onto the drop zone or pick one via the file dialog (MP4, MOV, WebM, up to FullHD 1920×1080, up to 500 MB).
- Pick a model: Quality (best hair edges, requires a modern GPU) or Speed (faster, works on weaker hardware and longer videos).
- Pick an output mode: Transparent (WebM with alpha channel) or Solid color (MP4 with the chosen background color).
- Wait for the one-time model download — afterwards it's cached in your browser, available offline.
- Preview the result and download the finished video.
Why do other services have to upload your video?
Cloud-based video background removers have to send your video to a server, process it there, and hand you a download URL. That’s not a design flaw — it’s architecturally unavoidable: AI inference on a GPU cluster costs money, refinanced through subscriptions, watermarks, credits, or ads.
The problem for personal recordings: you have no control over what happens to the video on the server, how long it stays, or whether it ends up in training data. Cloud services also get shut down regularly — when the provider stops operating, every uploaded file is gone.
Modern browsers can flip this equation. Through the WebGPU interface, the browser talks directly to the graphics card on your device and runs the AI model fully locally. The model loads once, gets cached, and stays available — even offline.
How does background removal work?
The tool combines three browser interfaces that have been available in all current desktop browsers since 2024:
Video decoding in the browser. Your video is decoded frame by frame using the browser’s built-in hardware decoder — the same decoder used for streaming, no external codec library required.
AI inference on the graphics card. A specialized neural network for image segmentation runs directly on the GPU. For every frame it computes an alpha mask: a greyscale image telling each pixel how strongly it belongs to the foreground. The input is scaled to a standardized size and the result is upscaled bilinearly back to the original resolution.
Frame compositing and encoding. The masked frame is combined with the chosen background and re-encoded. Transparent mode produces a WebM container with VP9 and an alpha channel — the format DaVinci Resolve, Premiere Pro, and After Effects can read directly. Solid color mode produces an MP4 with H.264 over the chosen background color — playable on any device.
What are common use cases?
Social media content. Short-form creators on YouTube, TikTok, and Instagram replace cluttered home backgrounds with a clean alpha cut-out to drop into a branded environment in their editor — no studio rental, no SaaS subscription.
Recorded presentations. Loom and Zoom recordings often have a distracting home-office background. Strip it out before sharing with clients.
Product demo videos. Software demo recordings often catch a messy desktop or cluttered toolbar. Remove the background and place the talent over a neutral or branded backdrop.
Online course material. Talking-head segments work better against on-topic imagery or slide content than against a wall — replace the background to reinforce the lesson visually.
Async work videos. Teams using async video (Loom, Vidyard, Notion video) can scrub their home environment from the frame before sharing with colleagues.
Which output options are available?
| Output mode | Format | Good for |
|---|---|---|
| Transparent | WebM + VP9 with alpha | Compositing in a video editor |
| Solid color | MP4 + H.264 | Quick share, ready to upload |
Image and video backgrounds, blur, and PNG-sequence export are planned for a later release — the current version covers the two output modes most users actually ship with.
What are the limits?
Honest expectations beat marketing promises:
Temporal consistency. The model processes each frame independently — it has no memory of the previous frame. On hair edges in motion this can show up as light flicker. The Quality mode minimizes it; for maximum stability apply edge smoothing in your editor.
Audio. The first version outputs the video track only. Workflow: keep the original open in your editor and lay its audio track over the cut-out — no lipsync drift, since frame rate and frame count stay identical.
Hardware. The Quality model needs a modern GPU with WebGPU support. On older Windows hardware or iOS Safari the tool automatically falls back to the Speed model — without crashing mid-inference.
Resolution. Maximum input FullHD (1920×1080). Higher resolutions are rejected because browser ML models sample 4K down to 1024 pixels internally, producing visible softening on hair edges. File size: soft cap 500 MB.
Frequently Asked Questions
Why does the segmentation look rough around hair and fine details?
Edge refinement on complex boundaries (hair, fur, fine fabric) is the hardest problem in video matting. The mask is upscaled with bilinear filtering to soften the boundary, but individual fine strands may not be perfectly isolated. For broadcast-quality matting a physical green screen plus chroma key still wins.
Does this replace green-screen software?
For controlled professional shoots, a physical green screen plus chroma-key software (DaVinci Resolve, Adobe Premiere) produces noticeably cleaner results. This tool is built for run-and-gun footage where a green screen is impractical — vlogs, quick social clips, and async video messages.
Can I process multiple videos in a batch?
Currently files are processed one at a time. Batch processing is on the roadmap.
Where exactly does processing happen?
Everything runs in your browser tab — on your CPU or, when available, on your GPU via WebGPU. There is no server upload, no external API, no cloud hop in the processing path. The only network connection is the one-time model download on first use; after that everything stays offline.
Which related video tools help next?
Other tools from the kittokit toolkit that fit the topic:
- HEVC to H.264 — convert iPhone videos (HEVC/MOV) to universally playable H.264 MP4, also without upload.
- Background Remover — remove the background from still images (JPG, PNG, HEIC), AI-based and upload-free.
- Webcam Background Blur — blur the background of your live camera feed in real time, for video calls without virtual rooms.
Last updated: