ADSP Logo

The AI Canvas Newsletter #1

Explore AI innovations like InstructDiffusion, Stable Audio, and Microsoft's phi-1.5.

ai canva 1

Welcome to The AI Canvas Newsletter Series

Our newsletter series, The AI Canvas, is designed to keep you informed about the latest advancements and innovations in the world of artificial intelligence. Each edition brings you key updates, insightful reports, and transformative projects from leading researchers and companies. Stay connected with us to discover how AI is reshaping industries, enhancing creativity, and driving the future of technology.

This Week in AI: Innovations, Reports, and Features

🖼️ Discover InstructDiffusion: Unifying diverse vision tasks into intuitive, human-guided processes.

🎵 Tune into Stable Audio: Revolutionising music generation with text metadata, duration, and start time.

📚 Learn about Phi-1.5: Microsoft's compact model that's making big strides in natural language tasks.

InstructDiffusion: Unifying Vision Tasks with Human Instructions

In the pursuit of artificial general intelligence in computer vision, a ground-breaking framework has emerged - InstructDiffusion. This innovative approach transforms diverse computer vision tasks into intuitive image-manipulation processes guided by human instructions. Unlike previous methods that struggled to unify tasks due to varying output formats, InstructDiffusion harnesses the power of denoising diffusion models and treats all tasks as instructional image editing. It excels in handling output formats like RGB images, binary masks, and key points, effectively covering a wide range of vision tasks.

Read More

Instruct diffusion

Stable Audio: Revolutionising Music Generation with Latent Diffusion Models

Step into the world of cutting-edge generative AI with Stability AI's Harmonai lab and their latest creation, "Stable Audio." While diffusion models have revolutionised image and video generation, music has posed unique challenges due to fixed-length constraints. Stable Audio changes the game by allowing audio generation to be guided by text metadata, audio duration, and start time. This approach offers creative control over both content and length, making it perfect for crafting complete songs. With speedy inference and a commitment to open-source accessibility, this 907M parameter U-Net model represents a significant leap in music generation technology.

If you want to learn more, you can read the full summary here.

newsletter1

Unlocking the Potential of Compact Language Models: Introducing phi-1.5

Microsoft Research continues to push the boundaries of smaller Transformer-based language models in their latest endeavour, "Textbooks Are All You Need II: phi-1.5 technical report." Building on the success of previous models like TinyStories and phi-1, this report explores the concept of common sense reasoning in natural language. Phi-1.5, a 1.3 billion parameter model, outshines models five times its size in natural language tasks, including grade-school mathematics and basic coding.

Find out more here.

phi 1.5

Technical Reads & Projects

Evaluation and Hallucination Detection for Abstractive Summaries

"Exploring the complexities of evaluating abstractive summaries, this article discusses various dimensions and metrics, including reference-based, context-based, preference-based, and sampling-based approaches, while also delving into methods for detecting hallucination, such as natural language inference and question-answering."

A New Age of Magic

"In the world of advanced technology and prompt engineering, 'Any sufficiently advanced technology is indistinguishable from magic.' Explore the parallels between coding and casting spells in this journey into the new age of AI-powered creativity and innovation."

Illusion Diffusion

"Given a prompt and your pattern, we use a QR code conditioned controlnet to create a stunning illusion!"

Centaurs and Cyborgs on the Jagged Frontier

"A lot of people have been asking if AI is really a big deal for the future of work. We have a new paper that strongly suggests the answer is YES."

Journey Through the AI Canvas Podcast

Dive into 'The AI Canvas', our podcast exploring the transformative potential of generative AI. Engage in fireside chats, case studies, and innovative discussions on AI’s impact on industries and creativity.

Latest Episode of the AI Canvas

The AI Canvas - Generative AI in the Classroom: The Future of Learning with Francisco Recalde

In this enlightening episode of the AI Canvas podcast, host David Foster sits down with Francisco Recalde, Head of the Department of Languages at Dixon's Unity Academy, to explore the transformative effects of AI on education. They discuss AI’s potential in teaching and learning, the fear of AI replacing teachers, and the role of AI as a guide for students.

David Foster Headshot

David Foster

Founding Partner, ADSP

The AI Canvas