Ainsider vol.39 Newsletter

Everything you need to know from #AI space

ChatGPT ‘Canvas’ feature

OpenAI GPT Advanced Voice Assistant

Meta Movie Video Gen

PikaLabs introduced 1.5 Model

Leonardo launched AI image editing

The only one AI Toolkit you need - 50 AI Tools from 10 categories

ChatGPT ‘Canvas’ feature

OpenAI has recently launched an innovative feature called ChatGPT Canvas, which significantly enhances how users interact with the AI for writing and coding tasks.

This new interface allows for real-time editing and collaboration, enabling users to modify specific sections of text or code generated by ChatGPT, making it a powerful tool for both writers and developers.

Overview of ChatGPT Canvas
The Canvas feature introduces a side-by-side interface that operates alongside the traditional chat window. This setup allows users to directly edit text or code while receiving suggestions from ChatGPT in real time.
This marks a substantial shift from the previous model, where users had to input multiple prompts to refine outputs.
With Canvas, users can highlight particular sections of their work, request specific changes, and receive inline feedback, effectively transforming ChatGPT into a collaborative partner rather than just a passive tool.

Key Features of Canvas

  • Inline Editing: Users can edit specific parts of the generated text or code directly within the Canvas interface. This includes making adjustments to tone, length, and clarity, as well as adding emojis where appropriate.

  • Contextual Understanding: The AI is designed to better understand user objectives, allowing it to provide more relevant suggestions based on highlighted sections of text or code.

  • Shortcuts for Efficiency: Canvas includes a menu of shortcuts that streamline common tasks:

    • Suggest Edits: Provides inline suggestions for improving writing.

    • Adjust Length: Shortens or expands the output as needed.

    • Change Reading Level: Alters the complexity of the text from kindergarten to graduate level.

    • Debugging Tools: For coding tasks, users can quickly access functions like reviewing code, adding comments, fixing bugs, and translating code between languages such as Python and JavaScript.

  • Software and Complete Code Generation: Soon, ChatGPT Canvas will expand its capabilities to include complete software applications and code generation similar to what is offered by Claude AI’s Artifacts. This will allow users to not only edit existing code but also create comprehensive applications from scratch with ease.

  • Version Control: A back button allows users to revert to previous versions of their work, making it easy to track changes and explore different iterations of their projects.

Accessing ChatGPT CanvasCurrently in beta, the Canvas feature is available to ChatGPT Plus and Team subscribers.

You can find all details at OpenAI Blog article: https://openai.com/index/introducing-canvas/

OpenAI GPT Advanced Voice Assistant

OpenAI's New Voice Assistant: A Leap Forward in AI Interaction

OpenAI has recently unveiled its latest advancement in artificial intelligence with the introduction of GPT-4o, a model that significantly enhances the capabilities of voice interaction. This new voice assistant is designed to offer a more natural and intuitive experience, allowing users to engage with AI in a way that closely mimics human conversation.

Key Features of GPT-4o

Natural Speech Interaction
Unlike previous iterations that required a cumbersome process of transcribing speech to text, GPT-4o can operate directly in speech. This allows for faster responses and the ability to recognize nuances such as tone and emotion. Users can interrupt the AI mid-sentence, making the conversation feel more fluid and dynamic

.Advanced Emotional Recognition
The assistant is capable of interpreting emotional cues based on the user's tone of voice. This feature enables it to adjust its responses accordingly, enhancing the overall interaction experience. The ability to convey emotion, change tone, and even integrate sound effects makes the assistant remarkably lifelike.

Multimodal Capabilities
GPT-4o is not limited to voice; it also supports visual inputs and can engage in real-time language translation. This multimodal capability positions it as a versatile tool for various applications, from casual conversations to professional tasks

Realtime Voice API

OpenAI with latest updates introduced a Realtime API

It allows developers to integrate #AI voice assistants or conversational agents into their apps, supporting multimodal interactions with both text and audio inputs and outputs

PikaLabs launched 1.5 model

Pika Labs released their new Pika 1.5 model 👀

More realistic movement

Big screen shots

New features and effects called 'Pikaffects'

Leonardo launched AI image editing

With Leonardo AI you can edit Images with prompts

Like generative fill but from text prompts

The only one AI Toolkit you need - 50 AI Tools from 10 categories

LLM / Text:📝
- ChatGPT
- ClaudeAI
- Google Gemini
- Perplexity
- You.com
- Microsoft Copilot
-Hugginface Chat

🗣Chatbots:
- Botpress
- Voiceflow
- Air
- Dante AI
- ChatBase
-Flowise

📸 Images:
- Midjourney
- Flux.1
- Leonardo
- Adobe Firefly

🎥 Videos:
- Runway
- KlingAI
- Hotshot
- LumaLabs
- Vidu AI

🤳🏻Social Media:
- Simplified AI
- Sprout Social
- Emplifi
- Lately
- Brandwatch

💰Marketing:
- AdCreative
- Claude
- Jasper
- SEMrush

🔎Research:
- HARPA
- Perplexity
- Glasp

🎭 Presentations:
- Tome
- Simplified
- Beautiful.ai
- Synthesia

🧑🏻‍💻Copywriting:

- Grammarly

- Wordtune

- ChatGPT Plus

- WriteSonic

🔊Audio:

- Murf

- Eleven Labs

- Otter

- Speechity

🎨 Design:

- Canva

- Adobe Firefly

- Designs ai

- Mokker

-Microsoft Designer

🧠 Productivity:

- ChatGPT Plus

- Taskade

- Audio Pen

- Notion AI