The AI Canvas Newsletter #1
Explore AI innovations like InstructDiffusion, Stable Audio, and Microsoft's phi-1.5.
Welcome to The AI Canvas Newsletter Series
Our newsletter series, The AI Canvas, is designed to keep you informed about the latest advancements and innovations in the world of artificial intelligence. Each edition brings you key updates, insightful reports, and transformative projects from leading researchers and companies. Stay connected with us to discover how AI is reshaping industries, enhancing creativity, and driving the future of technology.
This Week in AI: Innovations, Reports, and Features
🖼️ Discover InstructDiffusion: Unifying diverse vision tasks into intuitive, human-guided processes.
🎵 Tune into Stable Audio: Revolutionising music generation with text metadata, duration, and start time.
📚 Learn about Phi-1.5: Microsoft's compact model that's making big strides in natural language tasks.
InstructDiffusion: Unifying Vision Tasks with Human Instructions
In the pursuit of artificial general intelligence in computer vision, a ground-breaking framework has emerged - InstructDiffusion. This innovative approach transforms diverse computer vision tasks into intuitive image-manipulation processes guided by human instructions. Unlike previous methods that struggled to unify tasks due to varying output formats, InstructDiffusion harnesses the power of denoising diffusion models and treats all tasks as instructional image editing. It excels in handling output formats like RGB images, binary masks, and key points, effectively covering a wide range of vision tasks.
Stable Audio: Revolutionising Music Generation with Latent Diffusion Models
Step into the world of cutting-edge generative AI with Stability AI's Harmonai lab and their latest creation, "Stable Audio." While diffusion models have revolutionised image and video generation, music has posed unique challenges due to fixed-length constraints. Stable Audio changes the game by allowing audio generation to be guided by text metadata, audio duration, and start time. This approach offers creative control over both content and length, making it perfect for crafting complete songs. With speedy inference and a commitment to open-source accessibility, this 907M parameter U-Net model represents a significant leap in music generation technology.
If you want to learn more, you can read the full summary here.
Unlocking the Potential of Compact Language Models: Introducing phi-1.5
Microsoft Research continues to push the boundaries of smaller Transformer-based language models in their latest endeavour, "Textbooks Are All You Need II: phi-1.5 technical report." Building on the success of previous models like TinyStories and phi-1, this report explores the concept of common sense reasoning in natural language. Phi-1.5, a 1.3 billion parameter model, outshines models five times its size in natural language tasks, including grade-school mathematics and basic coding.
Find out more here.
Technical Reads & Projects
Evaluation and Hallucination Detection for Abstractive Summaries
"Exploring the complexities of evaluating abstractive summaries, this article discusses various dimensions and metrics, including reference-based, context-based, preference-based, and sampling-based approaches, while also delving into methods for detecting hallucination, such as natural language inference and question-answering."
A New Age of Magic
"In the world of advanced technology and prompt engineering, 'Any sufficiently advanced technology is indistinguishable from magic.' Explore the parallels between coding and casting spells in this journey into the new age of AI-powered creativity and innovation."
Illusion Diffusion
"Given a prompt and your pattern, we use a QR code conditioned controlnet to create a stunning illusion!"
Centaurs and Cyborgs on the Jagged Frontier
"A lot of people have been asking if AI is really a big deal for the future of work. We have a new paper that strongly suggests the answer is YES."
Journey Through the AI Canvas Podcast
Latest Episode of the AI Canvas
The AI Canvas - Generative AI in the Classroom: The Future of Learning with Francisco Recalde
In this enlightening episode of the AI Canvas podcast, host David Foster sits down with Francisco Recalde, Head of the Department of Languages at Dixon's Unity Academy, to explore the transformative effects of AI on education. They discuss AI’s potential in teaching and learning, the fear of AI replacing teachers, and the role of AI as a guide for students.
David Foster
Founding Partner, ADSP
In Case You Missed It: Explore Our Recent Newsletters
The AI Canvas Newsletter #2
Explore the latest AI innovations: DALL-E 3's detailed image creation, AlphaMissense's genetic mutation classification, and Microsoft...
The AI Canvas Newsletter #3
Discover AI's new frontiers: ChatGPT's voice and image interactions, Mistral 7B's coding expertise, and Meta's innovative creative tools and assistants.
The AI Canvas Newsletter #4
Explore AI's latest: OpenAI's versatile robotics learning, Microsoft multimodal GPT-4V model, and the expansive capabilities of LLAMA 2 Long...
Looking for more specialised consultancy?
At ADSP we’re a team of data experts who build AI products with purpose.