【AI News 2025.04.16】画像/音声/動画/3D関連

2025.04.16

画像関連

Freepikが超速対応！『Kling 2.0』と『Composition Reference』を同時サポート！

This is big 🔥 Kling 2.0 is here, and we’re @Kling_ai's official launch partners

Smoother motion, stronger prompt response, film-level results

Text-to-video or image-to-video, up to 10s

No need to look elsewhere, we're the first AI suite to include it pic.twitter.com/gvpFJwPChD
— Freepik (@freepik) April 15, 2025

Introducing Composition Reference. Build any visual from a reference image or a sketch with notes

Add notes and generate images with the same structure and elements as your draft

Tips and tricks below 🧵👇 pic.twitter.com/qKueGZC6B3
— Freepik (@freepik) April 14, 2025

素材プラットフォームのFreepikが、画像生成モデルKling 2.0と新機能のComposition Referenceへの対応を、驚きの速さで発表しました。Kling 2.0をFreepik上でそのまま使える環境が整ったので、Freepikのクレジットを持っているユーザーは、さっそく検証や活用してみましょう。

またすごいの出してきた！万能画像生成フレームワーク、ByteDance『UNO』公開

bytedance.github.io

Less-to-More Generalization: Unlocking More Controllability by In-Context Gen...

https://bytedance.github.io/UNO/

ControlNetやOmini、IP-Adapterなどで個別に対応していた画像生成タスクを、ひとつのモデルに統合したユニバーサルフレームワークUNO。処理を1モデルで完結できるのは、手間が減り、より効率的なワークフローが実現できるので、非常にありがたいアップデートです。

『Pika』のTwists機能、ついに全ユーザーに開放！

Your videos, with a shocking twist. Introducing Pika Twists: manipulate any character or object in your footage, while keeping the rest perfectly intact.

Available now at Pika dot art or on the iOS app. pic.twitter.com/xBnv9deEG4
— Pika (@pika_labs) April 10, 2025

Pikaに新機能Twistsが追加され、全ユーザーに向けて公開されました。Twistsを使うことで、他の部分はそのままに、動画内のキャラクターやオブジェクトを自由にコントロールできるようになります。映像表現の自由度が一気に高まる追加機能ですね。

音声関連

無料AI音声合成ソフト『AivisSpeech』がアップデート！ver 1.1.0-preview.3が公開

【📢お知らせ】
04/01 より、AivisSpeech 1.1.0-preview.3 の公開を開始しております！
改善や不具合修正を多数含んでおりますので、ぜひアップデートをお願いいたします🙏

⬇️ ダウンロードはこちらhttps://t.co/Qy1pWFKgT8

🙇 1.1.0… https://t.co/ef4y4YGVzP pic.twitter.com/jbiE1crRLt
— Aivis Project (@aivis_project) April 15, 2025

無料で使えるAI音声合成ソフトAivisSpeechが、バージョン1.1.0-preview.3へとアップデートされました。無料でありながら、自然な音声生成が可能なところが魅力。最近、AI-Systemのマニュアルを読ませたりするために検証し始めた所だったのでありがたいです。音声系のAIツールを探している方にとって非常に心強い存在です。

動画関連

『KLING 2.0』正式発表！Kling AIの進化が止まらない

⚡️ Massive Update Just Dropped: Phase 2.0 for Kling AI!
🎥 KLING 2.0 Master for video generation, 🖼️ KOLORS 2.0 for image generation, 🎮 Multi-Elements Editor, 🎨 Image Editing & Restyle…
Kling AI 2.0 is all about empowering creators to bring meaningful stories to life — with… pic.twitter.com/lPXliuZB7q
— Kling AI (@Kling_ai) April 15, 2025

So cinematic & impactful! We've partnered with @visualsk2 (on Instagram) on this action film "The Surge" using the latest KLING 2.0 model!
🎤 Let's hear from VISUALSK2:
"In The Surge, a shattered world sets the stage for the journey of a lone protagonist who, upon discovering a… pic.twitter.com/xV1bbLJYOj
— Kling AI (@Kling_ai) April 15, 2025

We've partnered with the maestro @jacopo_reale to create this delicate short film "Love at First Sight" using the latest KLING 2.0 model!
🎤 Let's hear from Jacopo:
"Love at First Sight is a film that explores the deep connection between art and the power of imagination. It is a… pic.twitter.com/1kL5dXmWeI
— Kling AI (@Kling_ai) April 15, 2025

Kling AIがグローバルローンチイベントにて、待望の最新モデルKLING 2.0を正式に発表しました。高精度な動画生成で注目を集めてきたKlingですが、今回のアップデートでは、動きの滑らかさや構図の一貫性がさらに強化され、表現力が一段と向上しています。やっぱりKlingは強いですね。

Google『Whisk』から Veo2で動画生成が可能に

Whisk Animate is now available globally to Google One AI premium subscribers!

With Veo 2, turn your Whisk-generated images into vivid eight-second animated clips – perfect for reimagining your images in different styles or bringing new ideas to life. 🎥 pic.twitter.com/j7GZ3rZ7sh
— Google Labs (@GoogleLabs) April 15, 2025

Googleが、Whisk Animateを発表しました。Veo2を使用すると、Whiskで生成された画像を8秒間のアニメーションクリップに変換できるそうです。Google One AI Premium ユーザーが利用可能です。

Googleの動画生成モデル『Veo2』、Gemini Advancedユーザー向けに公開

Today we’re rolling out Veo 2, our state-of-the-art video generation model, to Gemini Advanced users. Create high-quality 8s videos from text in any style: https://t.co/bk1bIk4bTs

Make a name for yourself with Veo 2 (just like we did).

Prompt for this one: “Turn the word… pic.twitter.com/YWNsuMsJHg
— Google Gemini App (@GeminiApp) April 15, 2025

Googleの最新動画生成モデルVeo2が、Gemini Advancedユーザー向けに提供開始されました。Veo2は、高精度な映像表現と自然な動きの再現力が特徴。Gemini Advancedは、Googleの上位プランで提供されるAI機能群のひとつで、最先端の映像生成をいち早く体験したい方には必見のアップデートです。

66.5万時間の学習の成果!? ByteDanceの動画生成モデル『Seaweed-7B』がスゴすぎる

seaweed.video

Seaweed

https://seaweed.video

Seaweed is a video generation foundational model by ByteDance Seed.

ByteDanceが新たに発表した動画生成モデルSeaweed-7Bは、NVIDIA H100をなんと66.5万時間使用してゼロから学習させた超大規模モデルです。1分間の動画生成、2Kへのアップスケール、720p・24fpsのリアルタイム生成など、驚異的な性能になっています。現時点で一般公開の詳細は明かされていませんが、その実力と今後の展開には大きな注目が集まっています。

カメラモーションのミックス機能で表現力アップ『Higgsfield Mix』

Higgsfield Mix breaks the limits of camera motion.

You can combine two motion controls in a single shot for more complex pacing, sharper turns, or movements that were never possible on set.

Here’s what it looks like in action: pic.twitter.com/5atJpN7W5O
— Higgsfield AI 🧩 (@higgsfield_ai) April 14, 2025

Higgsfieldでカメラモーションのミックスが可能になりました。複数の視点や動きを組み合わせることで、よりダイナミックで洗練された映像演出が実現できます。Higgsfieldはカメラワークだけでなく、ビジュアルエフェクトのクオリティも高いのが魅力ですね。

AIによる直感的な動画編集ツール PonderAI『Cursor for Video Editing』

we built Cursor for video editing pic.twitter.com/eT68hRhFNq
— timothy (@timwangyc) April 14, 2025

ponderstudio.ai

PonderAI

https://ponderstudio.ai

Natural Language Video Editing.

Cursorを活用した動画編集ツールが登場しました。自然言語での指示を通じて動画編集が行えるというもの。テキスト入力で編集操作ができるこの仕組みは、興味深いですね。MCPとAdobe Premiere Proの連携でも同様の操作が実現できれば、制作現場はありがたいですね。

ついに使える！Adobeの『Firefly Video Model』が誰でも試せるように

In the evolving landscape of AI, generating high-quality video content has become critical for creators.

Adobe released the new Firefly Video Model that enables the generation of creative, relevant, and high-quality video content.

Find out more:https://t.co/64zWAtNXPc pic.twitter.com/prUwNRrCWw
— Adobe Experience Cloud (@AdobeExpCloud) April 15, 2025

Adobeが開発する生成AI、Fireflyの動画版Firefly Video Modelがβ版で一般公開され、誰でも利用できるようになりました。現段階ではベータ版ながらも、高速な処理と表現力で注目を集めています。今後、PhotoshopやPremiereとの連携も視野に入れた発展が期待される、今後が楽しみなプロジェクトです。

表情豊かな音声同期アバター生成に対応、Alibabaが『FantasyTalking』を開発中

fantasy-amap.github.io

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Syn...

https://fantasy-amap.github.io/fantasy-talking/

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Alibabaが開発中のFantasyTalkingは、Talking Head系のフレームワークで、自然な口元や視線の動きが特徴です。先週発表されたTencentのACTalkerに続き、中国の大手テック企業による進展が続いていますね。FantasyTalkingは今後モデルの公開も予定されており、今後の展開に要注目です。

3D関連

これは便利！3Dモデルを自動で分割する『HoloPart』

vast-ai-research.github.io

HoloPart: Generative 3D Part Amodal Segmentation

https://vast-ai-research.github.io/HoloPart/

generative 3D part amodal segmentation--decomposing a 3D shape into complete, semantically meaningful parts.

HoloPartは、3Dモデルを自動でパーツごとに分割してくれるフレームワークです。最近、Hunyuanの3D生成ワークフローを組んだので、最後の仕上げにこのモデルを使って、パーツ分までやってみたいと思います。そのうちComfyUI対応するのを期待しています。

Tencentが『Hunyuan 3D』のWebサービスを公開！オープンソースモデルも提供

Create your own exclusive 3D model in just 5 steps! 🌟
1️⃣ Log in to the Hunyuan 3D website: https://t.co/0GDWwAuvRm
2️⃣ Head to the 'Laboratory' page 🧪
3️⃣ Select the 'Sketch to 3D' feature ✏️
4️⃣ Upload your sketch image and add a text description—describe the object, its color,… pic.twitter.com/AZMFIBH6We
— Hunyuan (@TencentHunyuan) April 10, 2025

Tencentの3D生成モデルHunyuan 3Dが、オープンソースとして公開されただけでなく、誰でも使えるWebサービスとしても提供開始されました。ローカル環境での実行が難しいユーザーでも簡単に利用できるのは非常にありがたいポイントですね。

▼この記事の監修

takio koizumi
デジタルアーティスト。デジタルハリウッド大学で3DCGを学ぶ。大学院修了後、VFXアーティストとして約10年間、映画・アニメ・ゲームなど多彩なジャンルの作品を手がける。近年はAIに精通し、生成AI技術を取り入れたワークフローを研究し発信している。
HP： https://sites.google.com/view/takio-koizumi/link

NEWS

【AI News 2025.04.16】画像/音声/動画/3D関連

画像関連

Freepikが超速対応！『Kling 2.0』と『Composition Reference』を同時サポート！

またすごいの出してきた！万能画像生成フレームワーク、ByteDance『UNO』公開

『Pika』のTwists機能、ついに全ユーザーに開放！

音声関連

無料AI音声合成ソフト『AivisSpeech』がアップデート！ver 1.1.0-preview.3が公開

動画関連

『KLING 2.0』正式発表！Kling AIの進化が止まらない

Google『Whisk』から Veo2で動画生成が可能に

Googleの動画生成モデル『Veo2』、Gemini Advancedユーザー向けに公開

66.5万時間の学習の成果!? ByteDanceの動画生成モデル『Seaweed-7B』がスゴすぎる

カメラモーションのミックス機能で表現力アップ『Higgsfield Mix』

AIによる直感的な動画編集ツール PonderAI『Cursor for Video Editing』

ついに使える！Adobeの『Firefly Video Model』が誰でも試せるように

表情豊かな音声同期アバター生成に対応、Alibabaが『FantasyTalking』を開発中

3D関連

これは便利！3Dモデルを自動で分割する『HoloPart』

Tencentが『Hunyuan 3D』のWebサービスを公開！オープンソースモデルも提供

関連記事一覧

【AI News 2025.02.26】LLM/ComfyUI

【AI News 2025.05.07】おすすめ記事4選

【AI News 2025.04.16】AI活用/LLM/その他

【AI News 2025.04.09】LLM/ComfyUI/AIサービス関連

【AI News 2025.04.09】画像/動画/3D関連

【AI News 2025.04.23】画像/動画/3D関連

【AI News 2025.03.19】画像/動画/3D関連

【AI News 2025.04.30】LLM/AI活用