【AI News 2025.04.23】画像/動画/3D関連

2025.04.23

画像関連

ByteDanceが2K対応の画像生成モデル『Seedream 3.0』を発表

Dreamina AI has officially launched Seedream 3.0 – its cutting-edge new model. The wait is over！
This is the latest breakthrough in AI image generation.

With Seedream 3.0, Dreamina AI delivers:
🎬 Cinematic-quality visuals
🖼️ 2K resolution output
💎 Ultra-realistic textures &… pic.twitter.com/T03EmHw80S
— Dreamina AI (@dreamina_ai) April 22, 2025

seed.bytedance.com

ByteDance Seed

https://seed.bytedance.com/en/tech/seedream3_0

ByteDanceが新たに発表した画像生成モデルSeedream 3.0は、現時点で画像生成モデルの中でも最も高いスコアを記録したそうです。ネイティブで2K（2048×2048）の出力ができ、1024×1024サイズの画像がわずか3秒で生成可能という驚異的な性能です。モデル自体の公開はされていませんが、同社の生成サービスDreaminaのみで利用が可能になっています。

複数画像から構図要素を抽出・再構成する画像生成モデルNVIDIA『IP-Composer』

GitHub

IP_Composer/README.md at master · saradorfman1/IP_Composer

https://github.com/saradorfman1/IP_Composer/blob/master/README.md

Contribute to saradorfman1/IP_Composer development by creating an account on GitHub.

NVIDIAが新たに公開したIP-Composerは、異なるいくつかのソースから一つのまとまりのある構成に融合させ、画像生成させるというもの。ワークフローにこのような需要も増えてきたので、ComfyUI上での再現も検討してみたいと思います。

動画関連

llyasviel氏が手がける革新的な軽量動画生成フレームワーク『FramePack』

GitHub

GitHub - lllyasviel/FramePack: Lets make video diffusion practical!

https://github.com/lllyasviel/FramePack

Lets make video diffusion practical! Contribute to lllyasviel/FramePack development by creating an account on GitHub.

lllyasviel.github.io

FramePack

https://lllyasviel.github.io/frame_pack_gitpage/

GitHub

GitHub - kijai/ComfyUI-FramePackWrapper

https://github.com/kijai/ComfyUI-FramePackWrapper

Contribute to kijai/ComfyUI-FramePackWrapper development by creating an account on GitHub.

ControlNetやIC-Lightを開発したllyasviel氏が、新たに動画生成フレームワークFramePackを発表しました。HunyuanVideoをベースに設計され、5秒の動画をVRAM 6GB環境で約8分で生成できるという軽量仕様が特徴。メモリ負荷を抑えつつ過去フレームに遡る設計によって映像生成を実現しています。さらに、Teacacheによる高速化やLoRA対応も進行中。Kijai氏によるComfyUI対応もすでに進んでおり、当分はFramePack関係の開発が話題になりそうです。

高精度な動画生成モデル『SkyReels-V2』が公開

GitHub

GitHub - SkyworkAI/SkyReels-V2: SkyReels-V2: Infinite-length Film Generative ...

https://github.com/SkyworkAI/SkyReels-V2

SkyReels-V2: Infinite-length Film Generative model - SkyworkAI/SkyReels-V2

OpenArt

SkyReels(V2) & Comfyui | ComfyUI Workflow | OpenArt

https://openart.ai/workflows/alswa80/skyreelsv2-comfyui/3bu3Uuysa5IdUolqVtLM

Created by: Abdallah Alswaiti: 1.Model: SkyReels-V2-I2V-1.3B-540P -Place In: ComfyUI/models/diffusion_models/ 2. Model File: clip_vision_h.safetensors Place In: ComfyUI/models/clip_vision/ 3. Text Encoder Models fp8 or fp16 Place In: ComfyUI/models/text_encoders/ 4. VAE Model wan_2.1_vae.safetensors Place In: ComfyUI/models/vae/

SkyworkAIが新たに公開したSkyReels-V2は、前モデルからさらに精度が向上した動画生成モデルです。WAN2.1の検証をしている間に公開されていました。WAN2.1のComfyUIワークフローを応用できそうなので、SkyReels-V2も同時に検証しようと思います。

アニメ表現だけじゃない！実写表現も進化『Vidu Q1』『Viduアプリ』

We know it’s been a minute… but trust us, something big is coming 👀 Stay tuned — the best is yet to come. #ViduQ1 #ViduAI pic.twitter.com/wtS2uzHJQS
— Vidu AI (@ViduAI_official) April 17, 2025

🚀 Vidu Q1 is here — unleash imagination with extreme Quality.
Four core capabilities, fully upgraded to elevate your creative journey:

🔸 Crisp Visuals — Sharper, more textured frames.
🔸 Cinematic Transitions — Fluid first-to-last frame movement
🔸 Precision Sound — Custom… pic.twitter.com/4unfSCGK8Q
— Vidu AI (@ViduAI_official) April 21, 2025

🚀 Vidu app is officially live!
We're excited to welcome everyone to explore a faster, smarter way to generate AI videos. #ViduQ1 #ViduAI pic.twitter.com/CvRx9gnQgG
— Vidu AI (@ViduAI_official) April 23, 2025

動画生成AI、Viduが最新モデルVidu Q1を発表し、さらに公式アプリもリリースされるなど、勢いを増しています。これまでアニメ調表現に強みを持っていたViduですが、今回のアップデートで実写系の表現力も大幅に向上。Klingに続く動きを見せており、今後の動画生成市場での存在感がさらに高まりそうです。アプリも登場したことで、より幅広いユーザーが気軽に高品質な映像制作を楽しめる環境が整ってきました。

Alibaba『Wan2.1-FLF2V-14B』は開始・終了フレーム指定型の動画生成モデル

2/3 🔧Powered by data-driven training and DiT architecture with First-Last Frame conditional control:
‒ Perfectly replicates reference visuals
‒ Precise instruction-following
‒ Smooth transitions + real-world physics adherence
‒ Cinema-quality 720P output pic.twitter.com/PDkUmfyOi3
— Wan (@Alibaba_Wan) April 17, 2025

🔹 Wan2.1 FLF2V 720P fp16 Workflow: https://t.co/oO3RWtmabI pic.twitter.com/Vy6gpfajHe
— ComfyUI (@ComfyUI) April 19, 2025

Alibabaが、スタートとエンドフレームを指定できる新たな動画生成モデルWan2.1-FLF2V-14Bを発表しました。発表直後にComfyUIがネイティブ対応し、ローカル環境でもスムーズに検証できるのが大きな魅力です。ローカル生成でここまでできることに感動しますね。ComfyUIワークフローは非常にわかりやすく、ComfyUIのワークフローデザイナーが加わったのでは？と予想します。

Hailuo AIにも『Character Reference』機能が実装される

Introducing Hailuo Image’s New Feature: Character Reference!

Transform a single image into dynamic, expressive characters with:
– Versatile angles, poses, and expressions
– Cinematic lighting and composition
– Full prompt control

📷 Let your characters move, emote, and… pic.twitter.com/WXKSdhkqZ0
— Hailuo AI (MiniMax) (@Hailuo_AI) April 23, 2025

Hailuo AIに、ついにCharacter Reference機能が実装されました。これはKling AIやViduにも実装されている機能で、キャラクターの一貫性を保ちながら映像を生成できる仕組みです。デザインや演出の自由度が大きく向上し、より魅力的なコンテンツ制作が可能になりました。

『Flex.2-preview』がControlNetやInpaintに対応

huggingface.co

ostris/Flex.2-preview · Hugging Face

https://huggingface.co/ostris/Flex.2-preview

ostrisで公開された画像生成モデルFlex.2-previewは、Apache 2.0ライセンスで利用可能です。ControlNetやInpaintにも対応しており、自由度の高い画像編集・生成が可能に。サービス展開を考えているユーザーにとっては、ライセンス面でも安心して使える有力な選択肢となりそうです。

ベンチマークを圧倒！オープンソースで登場した動画生成モデル『MAGI-1』

sand.ai

Magi - Revolutionary AI Video Generation

https://sand.ai/magi

The first autoregressive video model with top-tier quality output.Magi is a powerful AI video generator that transforms your ideas into stunning videos for free. Extend videos effortlessly with cutting-edge Generative AI tech!

GitHub

GitHub - SandAI-org/MAGI-1: MAGI-1: Autoregressive Video Generation at Scale

https://github.com/SandAI-org/Magi-1

MAGI-1: Autoregressive Video Generation at Scale. Contribute to SandAI-org/MAGI-1 development by creating an account on GitHub.

🚰 Magi thoroughly understands the laws of physics.

🏆 Magi leads the Physics-IQ Benchmark with exceptional physics understanding

⏳ 4/5 pic.twitter.com/1V75xxFLw3
— Sand.ai (@SandAI_HQ) April 21, 2025

SandAIが発表した新たな動画生成モデルMAGI-1が、オープンソースとして公開されました。ベンチマークテストでは非常に高い精度を示しており、今後の動画生成技術の新たな基準になりそうな勢いです。動向をしっかり追いながら、活用シーンも広げてみたいですね。

Live2Dを活用してAIチューバー化する『Persona Engine』

GitHub

GitHub - fagenorn/handcrafted-persona-engine: An AI-powered interactive avata...

https://github.com/fagenorn/handcrafted-persona-engine?tab=readme-ov-file

An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming, and virtual assistant applications. - fagenorn/handcrafted-persona-engine

Live2Dモデルを活用してAIチューバーを作成できるPersona Engineが公開されました。インタラクティブなアバターを作成するためのオールインワンツールキットが魅力で、個人でも手軽にAIキャラクターを展開できるというもの。用途次第で配信やコンテンツ制作、インタラクティブな体験コンテンツなど幅広い活用が期待できそうです。

Luma AI『Camera Angle Concepts』がカメラワークの幅をさらに拡大

Introducing Camera Angle Concepts for #Ray2 — a new way to control your POV. Choose your angle and consistently frame your story with cinematic perspectives like overhead, selfie, low angle, over the shoulder, aerial, and more. Available now in #DreamMachine. pic.twitter.com/jOJupiq3Kd
— Luma AI (@LumaLabsAI) April 18, 2025

Luma AIが新たに発表したCamera Angle Conceptsでは、カメラワークのバリエーションがさらに増えました。これまで以上に多彩なアングルや演出が可能になり、細かなカメラ操作にこだわりたいユーザーにとっては、非常に嬉しいアップデートですね。

カメラも人物も自在に制御！Alibabaの新フレームワーク『Unifying Precisely 3D』発表

ewrfcas.github.io

Uni3C

https://ewrfcas.github.io/Uni3C/

Alibabaが発表したUnifying Precisely 3D（Uni3C）は、カメラワークと人物モーションの両方を自由にコントロールしながら動画生成ができるフレームワークです。近日中に公開予定とのことで、特にWan2.1などのモデルと組み合わせた活用ができればと期待しています。正式リリースが待ち遠しいですね。

3D関連

メッシュ精度が大幅向上した『Hunyuan 3D AI Engine 2.5』

Hunyuan 3D AI Engine Upgrade Releasing~ Join us and ask our experts any questions. https://t.co/HLehhfsNRM
— Hunyuan (@TencentHunyuan) April 23, 2025

TencentのHunyuanチームが開発するHunyuan 3D AI Engineがバージョン2.5にアップグレードされ、メッシュやテクスチャの精度が向上しました。今回のアップデートでは、マテリアル設定やリライティングなど、さまざまな機能を搭載したWebサービスとして強化されているもようです。中でも、メッシュの滑らかさが大きく改善された点が注目です。今後はオープンソースのアップグレードが行われなくなりそうな予感です。

ついに出た！3Dモデルへの自動リギングがオープンソースで可能な『UniRig』

zjp-shadow.github.io

Unirig: Efficient 3D Character Generation from Single Images

https://zjp-shadow.github.io/works/UniRig/

Unirig

GitHub

GitHub - VAST-AI-Research/UniRig: One Model to Rig Them All: Diverse Skeleton...

https://github.com/VAST-AI-Research/UniRig

One Model to Rig Them All: Diverse Skeleton Rigging with UniRig - VAST-AI-Research/UniRig

VAST AI Researchが、3Dモデルへの自動リギングを行うAIモデルUniRigをオープンソースで公開しました。画像→モデル生成→UV展開→テクスチャー→リギングまでをComfyUIワークフローに組み込めば、制作プロセスがさらにスムーズになりそうですね。

新しいトレーニングプロファイルが追加された『Postshot v0.6』

Postshot v0.6 is now available🚀New Training Profile for Improved Detail, Region of Interest Training, Orthographic Cameras, Lens Shift, Selection Filter, AE Scene Navigation Helper. Read more: https://t.co/Qz1A1gHZJA Download now https://t.co/7JixQ01gHv pic.twitter.com/MWkPiglQWK
— Jawset (@jawset) April 22, 2025

Postshotがv0.6にアップデートされたとのことで、当初、RTX 5090環境では動作しなかったため、検証が進められずにいましたが、今回の対応により本格的なパフォーマンステストが可能に！これから検証を始めるのが楽しみなアップデートです。

▼この記事の監修

takio koizumi
デジタルアーティスト。デジタルハリウッド大学で3DCGを学ぶ。大学院修了後、VFXアーティストとして約10年間、映画・アニメ・ゲームなど多彩なジャンルの作品を手がける。近年はAIに精通し、生成AI技術を取り入れたワークフローを研究し発信している。
HP： https://sites.google.com/view/takio-koizumi/link

NEWS

【AI News 2025.04.23】画像/動画/3D関連

画像関連

ByteDanceが2K対応の画像生成モデル『Seedream 3.0』を発表

複数画像から構図要素を抽出・再構成する画像生成モデルNVIDIA『IP-Composer』

動画関連

llyasviel氏が手がける革新的な軽量動画生成フレームワーク『FramePack』

高精度な動画生成モデル『SkyReels-V2』が公開

アニメ表現だけじゃない！実写表現も進化『Vidu Q1』『Viduアプリ』

Alibaba『Wan2.1-FLF2V-14B』は開始・終了フレーム指定型の動画生成モデル

Hailuo AIにも『Character Reference』機能が実装される

『Flex.2-preview』がControlNetやInpaintに対応

ベンチマークを圧倒！オープンソースで登場した動画生成モデル『MAGI-1』

Live2Dを活用してAIチューバー化する『Persona Engine』

Luma AI『Camera Angle Concepts』がカメラワークの幅をさらに拡大

カメラも人物も自在に制御！Alibabaの新フレームワーク『Unifying Precisely 3D』発表

3D関連

メッシュ精度が大幅向上した『Hunyuan 3D AI Engine 2.5』

ついに出た！3Dモデルへの自動リギングがオープンソースで可能な『UniRig』

新しいトレーニングプロファイルが追加された『Postshot v0.6』

関連記事一覧

【AI News 2025.04.02】画像/音楽/動画関連

【AI News 2025.05.07】画像/動画/3D/音楽

【AI News 2025.04.23】LLM/ComfyUI/AIサービス

【AI News 2025.04.09】LLM/ComfyUI/AIサービス関連

【AI News 2025.04.30】3D/音楽/音声

【AI News 2025.04.23】おすすめ記事5選

【AI News 2025.04.30】画像/動画

【AI News 2025.03.12】動画/3D