forked from ailab/sv3d
66 lines
3.1 KiB
Markdown
66 lines
3.1 KiB
Markdown
---
|
||
license: other
|
||
license_name: sv3d-nc-community
|
||
license_link: LICENSE
|
||
datasets:
|
||
- allenai/objaverse
|
||
pipeline_tag: image-to-video
|
||
extra_gated_prompt: >-
|
||
By clicking "Agree", you agree to the [License Agreement](https://huggingface.co/stabilityai/sv3d/blob/main/LICENSE) and acknowledge Stability AI's [Privacy Policy](https://stability.ai/privacy-policy).
|
||
extra_gated_fields:
|
||
Name: text
|
||
Email: text
|
||
Country: country
|
||
Organization or Affiliation: text
|
||
Receive email updates and promotions on Stability AI products, services, and research?:
|
||
type: select
|
||
options:
|
||
- Yes
|
||
- No
|
||
---
|
||
# Stable Video 3D
|
||
![](sv3doutputs.gif)
|
||
**Stable Video 3D (SV3D)** is a generative model based on [Stable Video Diffusion](https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt) that takes in a still image of an object as a conditioning frame, and generates an orbital video of that object.
|
||
|
||
Please note: For commercial use, please refer to https://stability.ai/membership.
|
||
|
||
## Model Details
|
||
|
||
This model was trained to generate 21 frames at resolution 576x576 given a context frame of the same size, finetuned from SVD Image-to-Video. Please check our [tech report](https://stability.ai/s/SV3D_report.pdf) and [video summary](https://youtu.be/Zqw4-1LcfWg) for details.
|
||
|
||
We release two variants of the model:
|
||
1. **SV3D_u**: This variant generates orbital videos based on single image inputs without camera conditioning.
|
||
2. **SV3D_p**: Extending the capability of SVD3_u, this variant accommodates both single images and orbital views allowing for the creation of 3D video along specified camera paths.
|
||
|
||
|
||
### Model Description
|
||
|
||
* **Developed by**: [Stability AI](https://stability.ai/)
|
||
* **Model type**: Generative image-to-video model
|
||
* **License**: [StabilityAI Non-Commercial Research Community License](https://huggingface.co/stabilityai/sv3d/raw/main/LICENSE).
|
||
* **Commercial License**: to use this model commercially, please refer to https://stability.ai/membership
|
||
|
||
|
||
### Model Sources
|
||
|
||
* **Repository**: https://github.com/Stability-AI/generative-models
|
||
* **Tech report**: https://stability.ai/s/SV3D_report.pdf
|
||
* **Video summary**: https://youtu.be/Zqw4-1LcfWg
|
||
* **Project page**: https://sv3d.github.io
|
||
* **arXiv page**: https://arxiv.org/abs/2403.12008
|
||
|
||
### Training Dataset
|
||
|
||
We use renders from the [Objaverse](https://objaverse.allenai.org/objaverse-1.0) dataset, utilizing our enhanced rendering method that more closely replicate the distribution of images found in the real world, significantly improving our model’s ability to generalize. We selected a carefully curated subset of the Objaverse dataset for the training data, which is available under the CC-BY license.
|
||
|
||
|
||
## Usage
|
||
|
||
For usage instructions, please refer to our [generative models GitHub repository](https://github.com/Stability-AI/generative-models)
|
||
|
||
|
||
### Out-of-Scope Use
|
||
|
||
The model was not trained to be factual or true representations of people or events,
|
||
and therefore using the model to generate such content is out-of-scope for the abilities of this model.
|
||
The model should not be used in any way that violates Stability AI's [Acceptable Use Policy](https://stability.ai/use-policy). |