66 lines
3.1 KiB
Markdown
66 lines
3.1 KiB
Markdown
|
---
|
|||
|
license: other
|
|||
|
license_name: sv3d-nc-community
|
|||
|
license_link: LICENSE
|
|||
|
datasets:
|
|||
|
- allenai/objaverse
|
|||
|
pipeline_tag: image-to-video
|
|||
|
extra_gated_prompt: >-
|
|||
|
By clicking "Agree", you agree to the [License Agreement](https://huggingface.co/stabilityai/sv3d/blob/main/LICENSE) and acknowledge Stability AI's [Privacy Policy](https://stability.ai/privacy-policy).
|
|||
|
extra_gated_fields:
|
|||
|
Name: text
|
|||
|
Email: text
|
|||
|
Country: country
|
|||
|
Organization or Affiliation: text
|
|||
|
Receive email updates and promotions on Stability AI products, services, and research?:
|
|||
|
type: select
|
|||
|
options:
|
|||
|
- Yes
|
|||
|
- No
|
|||
|
---
|
|||
|
# Stable Video 3D
|
|||
|
![](sv3doutputs.gif)
|
|||
|
**Stable Video 3D (SV3D)** is a generative model based on [Stable Video Diffusion](https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt) that takes in a still image of an object as a conditioning frame, and generates an orbital video of that object.
|
|||
|
|
|||
|
Please note: For commercial use, please refer to https://stability.ai/membership.
|
|||
|
|
|||
|
## Model Details
|
|||
|
|
|||
|
This model was trained to generate 21 frames at resolution 576x576 given a context frame of the same size, finetuned from SVD Image-to-Video. Please check our [tech report](https://stability.ai/s/SV3D_report.pdf) and [video summary](https://youtu.be/Zqw4-1LcfWg) for details.
|
|||
|
|
|||
|
We release two variants of the model:
|
|||
|
1. **SV3D_u**: This variant generates orbital videos based on single image inputs without camera conditioning.
|
|||
|
2. **SV3D_p**: Extending the capability of SVD3_u, this variant accommodates both single images and orbital views allowing for the creation of 3D video along specified camera paths.
|
|||
|
|
|||
|
|
|||
|
### Model Description
|
|||
|
|
|||
|
* **Developed by**: [Stability AI](https://stability.ai/)
|
|||
|
* **Model type**: Generative image-to-video model
|
|||
|
* **License**: [StabilityAI Non-Commercial Research Community License](https://huggingface.co/stabilityai/sv3d/raw/main/LICENSE).
|
|||
|
* **Commercial License**: to use this model commercially, please refer to https://stability.ai/membership
|
|||
|
|
|||
|
|
|||
|
### Model Sources
|
|||
|
|
|||
|
* **Repository**: https://github.com/Stability-AI/generative-models
|
|||
|
* **Tech report**: https://stability.ai/s/SV3D_report.pdf
|
|||
|
* **Video summary**: https://youtu.be/Zqw4-1LcfWg
|
|||
|
* **Project page**: https://sv3d.github.io
|
|||
|
* **arXiv page**: https://arxiv.org/abs/2403.12008
|
|||
|
|
|||
|
### Training Dataset
|
|||
|
|
|||
|
We use renders from the [Objaverse](https://objaverse.allenai.org/objaverse-1.0) dataset, utilizing our enhanced rendering method that more closely replicate the distribution of images found in the real world, significantly improving our model’s ability to generalize. We selected a carefully curated subset of the Objaverse dataset for the training data, which is available under the CC-BY license.
|
|||
|
|
|||
|
|
|||
|
## Usage
|
|||
|
|
|||
|
For usage instructions, please refer to our [generative models GitHub repository](https://github.com/Stability-AI/generative-models)
|
|||
|
|
|||
|
|
|||
|
### Out-of-Scope Use
|
|||
|
|
|||
|
The model was not trained to be factual or true representations of people or events,
|
|||
|
and therefore using the model to generate such content is out-of-scope for the abilities of this model.
|
|||
|
The model should not be used in any way that violates Stability AI's [Acceptable Use Policy](https://stability.ai/use-policy).
|