- Ainsider
- Posts
- Ainsider #22: Insane AI developents
Ainsider #22: Insane AI developents
Adobe Premiere with Gen AI | Llama3 | Stable Diffusion 3 | Latest AI Tools
Ainsider #22
Adobe is changing Video production forever
Huge AI developments in last week - Meta, Microsoft, Stability AI, Grok, Krea
Latest AI Tools i found
Tremendous update of Adobe Premiere Pro - with GenAI
Adobe showed demo of their new Adobe Firefly video model - inside of Premiere Pro App 📹
What's even more exciting here - you'll be able to include third party Video platforms integration like Runwayml, Pika Labs, and OpenAI’s Sora 🤯
Adobe showed demo of their new Adobe Firefly video model - inside of Premiere Pro App 📹
What's even more exciting here - you'll be able to include third party Video platforms integration like Runwayml, Pika Labs, and OpenAI’s Sora 🤯
Video creating & editing will be different… twitter.com/i/web/status/1…
— Piotr Macai (@piotrmacai)
4:38 PM • Apr 15, 2024
Stability AI released Stable Diffusion 3 model
Here's everything you need to know & key features of the Stable Diffusion 3 model 🧵👇
✅Improved Text Generation
Stable Diffusion 3 has significantly better text rendering capabilities compared to previous versions, allowing it to generate long sentences with accurate font styles and spacing.
It outperforms state-of-the-art text-to-image models such as DALL·E 3, Midjourney v6, and Ideogram v1 in terms of typography and prompt adherence, confirmed by human preference evaluations.
✅Enhanced Prompt Following
The model uses highly accurate image captions in training to improve its ability to follow prompts, making it as good as or better than DALL-E 3 in this regard.
✅Transformer-based Architecture
Stable Diffusion 3 utilizes a new Multimodal Diffusion Transformer (MMDiT) architecture, which is more efficient and scalable than previous models while also producing higher-quality images.
✅Rectified Flow Sampling
The model employs a rectified flow technique that simplifies the generative process, allowing for faster and more cost-effective image creation.
✅Availability and Pricing
Stable Diffusion 3 is currently available through the Stability AI Developer Platform API, with plans to make the model weights available for self-hosting with a Stability AI Membership in the near future.
The model is not free, but the membership fees are reported to be quite modest for small businesses.
Stable Diffusion 3 represents a significant advancement in text-to-image generation, especially with their API - which is a great starter for building own AI image models.
Meta Llama 3 model released
Everything you need to know 👇
✅8 billion and 70 billion parameter pre-trained and fine-tuned models available openly
✅Trained on over 15 trillion tokens, 7 times larger than the dataset used for Llama 2
✅Improved tokenizer with a vocabulary of 128,000 tokens for better performance
✅State-of-the-art performance across industry benchmarks
✅New capabilities such as enhanced reasoning and coding
✅3 times more efficient training compared to Llama 2
✅New trust and safety tools including Llama Guard 2, Code Shield, and CyberSec Eval 2
✅Integration into Meta AI
✅Availability in more countries across the Meta AI Apps
New Multimodal very powerful LLM
It's significant advancement for artificial intelligence 👀
Reka has introduced Reka Core - a multimodal LLM that is setting new benchmarks in the field 👀
Designed to process and understand a wide array of inputs including:
- text,
- images,
- audio,
- video
What's even more interesting - in recent evaluations Reka Core has
- outshone Claude 3 Opus in multimodal human evaluations,
- surpassed Gemini Ultra in video understanding,
- demonstrated competitive prowess with GPT-4 in image comprehension.
This positions Reka Core as one of the most capable LLMs available today 👀
Mulitmodal Grok 1.5V
Grok after update can process a wide variety of visual information - including documents, diagrams, charts, screenshots, and photographs.
Grok-1.5V will be available soon to our early testers and existing Grok users.
More details: https://x.ai/blog/grok-1.5v
Krea is introducing Video AI
Krea is introducing AI Video 📹👀
— Piotr Macai (@piotrmacai)
3:25 PM • Apr 19, 2024
Ultrarealistic Face Image to Video tool
Microsoft Research just released VASA-1 model, which can generate ultra-realistic talkative video with lip-syncing only from images.
See how insane it is:
We're entering into era the first time in human history when we can't believe our eyes 👀
Microsoft Research just released VASA-1 model, which can generate ultra-realistic talkative video with lip-syncing only from images 🤯
Below the best examples 👇
— Piotr Macai (@piotrmacai)
5:00 PM • Apr 18, 2024
Last powerful AI Tools i found
Kaggle
Kaggle is the world’s largest data science community. It offers a vast repository of community-edited machine learning models, datasets, and code.
As a data scientist or machine learning enthusiast, you can participate in competitions, collaborate on projects, and access valuable resources.
Noislee
Noislee turns sounds and melodies into music videos.
Create captivating music videos by transforming audio content into visual experiences.
Madgicx
Madgicx is an AI-powered ads marketing platform.
Optimize your advertising campaigns using AI algorithms for better targeting and performance.
Fotoram is a versatile photo editor and collage maker.
Enhance photos with basic tools (resize, crop, rotate) or explore advanced features like filters, effects, frames, and text.
PixiteAI
PixelMe! is a generative AI platform for fashion and e-commerce stores.
Use AI-generated content for marketing, especially in the fashion industry.
Udio.
Udio is very powerful AI music generation app.
Create personalized music compositions using AI algorithms.
BasedLabs.
BasedLabs. offers text-to-video and image art platforms.
Convert text-based content into engaging videos and images.