【AI News 2025.05.07】画像/動画/3D/音楽

2025.05.07

画像関連

Midjourneyの新機能『Omni‑Reference』、人物も小物も一貫描写が可能に

Omni-Reference is now live! It's a new way to say "put THIS in my image" for characters, objects, vehicles, and more. Check out the examples or see the full info dump below 👇 pic.twitter.com/NH4PmSIT2A
— Midjourney (@midjourney) May 1, 2025

Midjourneyが、任意の写真やイラストを指定すると複数の生成結果で同じ要素をブレずに描けるOmni‑Reference機能を公開しました。「これを画像に入れて」と指示できて、人物・小物・キャラクターまで一貫性を保ちながら生成ができるというもの。汎用性も高くて重宝しそうです。

Freepikの生成AIモデル『Flux 1.0』が利用料75％オフ

Flux 1.0 now costs 75% less

Create 4X more AI images, without spending extra

It’s not a limited deal, it’s here to stay 💥 pic.twitter.com/ihVAWHBDZv
— Freepik (@freepik) May 6, 2025

Freepikは、Flux 1.0のクレジット消費を75％削減し、同じ料金内で4倍の画像が生成できるようにしたと発表しました。新しいAIサービス・モデルの対応の速さの戦いに加えて、コスト面での戦いも始まりましたね。Freepikならコスト面での戦いにも対応できる体力はあると思うので、他のAI統合サービスがどう対抗するのか楽しみです。

『Recraft』にスタイルミックスの新UI登場！

Introducing Advanced Style Controls in Recraft

✨ Explore ready-to-use styles in the new Infinite library
🎛️ Mix them with your own images
🔁 Save + share styles to stay consistent across everything

No need to start from scratch or fight with prompts every time — your brand’s… pic.twitter.com/HehZaLwQPK
— Recraft (@recraftai) May 5, 2025

画像生成ツールRecraftに、複数スタイルを混ぜる際の割合を調整できる新機能が追加されました。特に注目すべきは、スタイルのブレンド比率を視覚的に操作できる洗練されたUI設計。生成結果をより細かくコントロールしたい時には、非常に魅力的な機能です。ComfyUIでも再現してみたいと思います。

『Ideogram 3.0』リリース！フォトリアルな質感と汎用性がさらに向上

Today, we're releasing a major upgrade to Ideogram 3.0: enhanced realism, more versatile styles, improved prompt following, and greater diversity.

You can now use Magic Fill and Extend with 3.0 in Ideogram Canvas to edit both uploaded and generated images.

Ideogram 3.0 is… pic.twitter.com/QqlsF18aXN
— Ideogram (@ideogram_ai) May 1, 2025

画像生成AIとして高い汎用性と精度を誇るIdeogramが、バージョン3.0へとアップデートされました。今回のアップデートでは、タイポグラフィを含むデザイン表現だけでなく、フォトリアルな質感の再現力も大幅に向上しています。

動画関連

『LTXVideo 13B 0.9.7』公開！高速化＆ComfyUI対応

Today we’re introducing our latest open source video model—and it’s a big one.

This release sets a new bar for speed, quality, and control. It’s faster than anything in its class, packed with new features, and ready to run on your own hardware.

Let’s break it down 👇

1/7 pic.twitter.com/nwvh2ioy2x
— LTX Studio (@LTXStudio) May 6, 2025

GitHub

GitHub - Lightricks/ComfyUI-LTXVideo: LTX-Video Support for ComfyUI

https://github.com/Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI. Contribute to Lightricks/ComfyUI-LTXVideo development by creating an account on GitHub.

動画生成モデルLTXVideoの最新版13B 0.9.7がリリースされました。NVIDIAのRTX 4090や5090で動作可能な量子化モデルに対応し、生成速度が向上。さらに、ComfyUI向けのサンプルワークフローも同時に公開されたことで、環境構築や検証がスムーズに行えます。Wan2.1とあわせて比較検証する価値も高そうです。

Higgsfieldが公開した10種類の新しいエフェクト

Introducing Higgsfield Effects – a cinematic pack of high-impact VFX.

10 new effects like Thunder God, Invisible, Turning Metal, Set On Fire and more.

An entire VFX team, packed into a single prompt. pic.twitter.com/QhcWAhQsSF
— Higgsfield AI 🧩 (@higgsfield_ai) May 5, 2025

動画生成AIを手がけるHiggsfield AIが公開したHiggsfield Effectsのデモ映像が公開されています。人物の表情や動きに自然な変化を加えるそのエフェクトは、滑らかさとリアルさの両立が印象的です。Higgsfieldのエフェクトはクオリティが高いですね。

3Dモデルを活用した動画生成が可能になったRunway『Gen-4』

Using 3D assets with Gen-4 References is a simple way to bring highly detailed and specific models into your generative workflows for even more consistency and control. To do so, simply provide a location plate, a quick comp of your 3D model in that space and a style reference. pic.twitter.com/gavfhX4Nrb
— Runway (@runwayml) May 5, 2025

With Gen-4 References you can use 3D models to quickly position objects exactly where you want them. With just three images: a location, a 3D asset and a style reference, you can generate highly refined outputs without any rigging, texturing or rendering. pic.twitter.com/NQtPXGhKJd
— Runway (@runwayml) May 6, 2025

Runwayの最新モデルGen-4で、3Dモデルを参照に使った動画生成が可能になりました。動きや視点の再現性が高く、理想的な映像制作のワークフローじゃないでしょうか。Wan2.1を活用したこのモデルに近いワークフローがそろそろ完成するので比較検証したいと思います。

3D関連

『Meshy』が名画を3Dで再現！幻想的な世界が映像で蘇る

🌠Classic paintings just woke up in 3D.

Most 3D models in this video were generated with Meshy.

Want to see how 3D artist @MedelFPV 👨‍🎨 turns iconic art into living worlds? Check the thread! 🧵#CGI #VFX #animation #digitalart #Monet #Vermeer pic.twitter.com/ucJOC8mzW8
— MeshyAI (@MeshyAI) May 5, 2025

Full workflow breakdown:
– ChatGPT: image generation
– Photoshop & Krea: image manipulation
– Meshy: 3D model generation (Check assets below👇)
– Blender: modeling, texturing, scene creation, lighting, rigging, animation
– After Effects: compositing
– Premiere: music &… pic.twitter.com/gJe26Bd1Ne
— MeshyAI (@MeshyAI) May 5, 2025

3D生成AIのMeshyが、古典的な名画を立体化するプロジェクトを公開しました。映像内の3Dモデルの多くはMeshyで作られており、幻想画家ベクシンスキーのような世界観がそのまま動き出したような体験に。作品のメイキング映像も公開されており、生成プロセスに興味のある方にもおすすめです。

画像1枚から高精度な3Dシーンを生成する『CAST』

sites.google.com

CAST

https://sites.google.com/view/cast4

Abstract Recovering high-quality 3D scenes from a single RGB image is a challenging task in computer graphics. Current methods often struggle with domain-specific limitations or low-quality object generation. To address these, we propose CAST (Component-Aligned 3D Scene Reconstruction from a

Deemosが発表した新技術CAST（Component-Aligned 3D Scene Reconstruction）は、RGB画像1枚から高品質な3Dシーンを構築できる新しいフレームワーク。CASTの登場により、AIによる3D生成環境が一気に進化する可能性がありますね。

音楽関連

商用利用OK＆日本語対応の音楽生成モデル『ACE-Step』

We’re excited to release ACE-Step / ACE-Step-v1-3.5B, a fast, versatile DiT-based foundation model for music generation that runs on consumer-grade GPUs.

With its simple architecture and low hardware requirements, it’s easy to fine-tune for various music tasks, empowering, not… pic.twitter.com/ATm78xTafy
— ACE Studio (@ACEStudio_en) May 6, 2025

高品質な音楽生成モデルACE-Step / ACE-Step-v1-3.5Bがオープンソースで公開されました。Apache 2.0ライセンスで商用利用も可能。LoRA対応や一般的なGPUでの動作、さらには日本語対応と、使いやすさと汎用性の両立が魅力です。音楽生成の民主化を後押しするモデルとして、今後活用の幅が広がりそうです。

『Suno v4.5』が驚異の進化！AI音楽生成の完成度がさらに次の次元へ

v4.5 just dropped for Pro & Premier subscribers 🔥 A wider range of genres, richer vocals, & enhanced prompt understanding for songs that match your vision.

What’s New:
🙌 Expanded genres & smarter mashups: More genre options — Blends like midwest emo + neosoul or EDM + folk… pic.twitter.com/MGp3zH8VHd
— Suno (@SunoMusic) May 1, 2025

AI音楽生成で注目を集めるSunoが、Pro および Premier 加入者向けに、最新バージョンv4.5をリリースしました。楽曲の構成、音質、表現力のすべてが飛躍的に向上しており、人間が制作したと錯覚するほどの完成度です。凄すぎますね。

▼この記事の監修

takio koizumi
デジタルアーティスト。デジタルハリウッド大学で3DCGを学ぶ。大学院修了後、VFXアーティストとして約10年間、映画・アニメ・ゲームなど多彩なジャンルの作品を手がける。近年はAIに精通し、生成AI技術を取り入れたワークフローを研究し発信している。
HP： https://sites.google.com/view/takio-koizumi/link

NEWS

【AI News 2025.05.07】画像/動画/3D/音楽

画像関連

Midjourneyの新機能『Omni‑Reference』、人物も小物も一貫描写が可能に

Freepikの生成AIモデル『Flux 1.0』が利用料75％オフ

『Recraft』にスタイルミックスの新UI登場！

『Ideogram 3.0』リリース！フォトリアルな質感と汎用性がさらに向上

動画関連

『LTXVideo 13B 0.9.7』公開！高速化＆ComfyUI対応

Higgsfieldが公開した10種類の新しいエフェクト

3Dモデルを活用した動画生成が可能になったRunway『Gen-4』

3D関連

『Meshy』が名画を3Dで再現！幻想的な世界が映像で蘇る

画像1枚から高精度な3Dシーンを生成する『CAST』

音楽関連

商用利用OK＆日本語対応の音楽生成モデル『ACE-Step』

『Suno v4.5』が驚異の進化！AI音楽生成の完成度がさらに次の次元へ

関連記事一覧

【AI News 2025.04.02】今週のおすすめ記事4選

【AI News 2025.04.09】おすすめ記事/その他ニュース

【AI News 2025.05.07】おすすめ記事4選

【AI News 2025.04.30】3D/音楽/音声

【AI News 2025.04.02】3D/ComfyUI/その他

【AI News 2025.04.23】おすすめ記事5選

【AI News 2025.05.07】LLM/ComfyUI/その他ニュース

【AI News 2025.02.26】LLM/ComfyUI

【AI News 2025.05.07】画像/動画/3D/音楽

画像関連

Midjourneyの新機能『Omni‑Reference』、人物も小物も一貫描写が可能に

Freepikの生成AIモデル『Flux 1.0』が利用料75％オフ

『Recraft』にスタイルミックスの新UI登場！

『Ideogram 3.0』リリース！フォトリアルな質感と汎用性がさらに向上

動画関連

『LTXVideo 13B 0.9.7』公開！高速化＆ComfyUI対応

Higgsfieldが公開した10種類の新しいエフェクト

3Dモデルを活用した動画生成が可能になったRunway『Gen-4』

3D関連

『Meshy』が名画を3Dで再現！幻想的な世界が映像で蘇る

画像1枚から高精度な3Dシーンを生成する『CAST』

音楽関連

商用利用OK＆日本語対応の音楽生成モデル『ACE-Step』

『Suno v4.5』が驚異の進化！AI音楽生成の完成度がさらに次の次元へ

関連記事一覧

【AI News 2025.04.02】今週のおすすめ記事4選

【AI News 2025.04.09】おすすめ記事/その他ニュース

【AI News 2025.05.07】おすすめ記事4選

【AI News 2025.04.30】3D/音楽/音声

【AI News 2025.04.02】3D/ComfyUI/その他

【AI News 2025.04.23】おすすめ記事5選

【AI News 2025.05.07】LLM/ComfyUI/その他ニュース

【AI News 2025.02.26】LLM/ComfyUI

Freepikの生成AIモデル『Flux 1.0』が利用料75％オフ