AIGuys Digest | Sept 2024
🌟 Welcome to the AIGuys Digest Newsletter, where we cover State-of-the-Art AI breakthroughs and all the major AI news🚀.
Don’t forget to check my new book on AI, it covers a lot of AI optimizations and hands-on code:
🔍 Inside this Issue:
- 🤖 Latest Breakthroughs: This month it’s all about OpenAI’s o1, METAs Segment Anything Model, Geometric Deep Learning Introduction, and Latest Developments in Music Generation.
- 🌐 AI Monthly News: Discover how these stories are revolutionizing industries and impacting everyday life: OpenAI o1 model reasoning capabilities, Meta’s latest augmented reality glasses, and New drama at OpenAI.
- 📚 Editor’s Special: This covers the interesting talks, lectures, and articles we came across recently.
Let’s embark on this journey of discovery together! 🚀🤖🌟
Follow me on Twitter and LinkedIn at RealAIGuys and AIGuysEditor.
Latest Breakthroughs
The biggest breakthrough of the last month has to be the release of the o1 model from OpenAI. Even though it is a closed-source model. We were able to put a good piece together delving deep into its possible architecture. Is it really smarter than a PhD student or is that just hype? Can it really think so before it answers? The answer is both yes and no. Read the full article here.
What Is Going On Inside OpenAIs Strawberry (o1)?
Even with state-of-the-art annotation tools, the complexity of annotating complex images limits human annotators to a mere 20 images per hour.
META’s Segment Anything Model (SAM) presents a groundbreaking method to significantly accelerate the annotation for a vast array of objects. Now you can annotate objects using just with text commands. How cool is that? Take a deep dive into how Meta did this amazing stuff.
METAs Segment Anything Model (SAM) Complete Breakdown
Do you want to know why Deep Learning works so well, what are its mathematical underpinnings? Then look no further than Symmetry.
Geometric Deep Learning unifies a broad class of ML problems from the perspectives of symmetry and invariance. These principles not only underlie the breakthrough performance of convolutional neural networks and the recent success of graph neural networks but also provide a principled way to construct new types of problem-specific inductive biases.
Geometric Deep Learning Introduction
Lately, the entire AI community feels like AI agents and LLMs are the only things happening in AI. But that’s not true, it is sad that other cool ideas do not get as much attention as they should. So, today we are going to dive deep into music generation and look into FluxMusic.
The reason I want you to read this blog is that people in AI should be exposed to new ideas, outside of LLMs, I feel somehow a lot of AI engineers just don’t know enough tricks and rely too much on API calls and copying code from HuggingFace.
Latest Developments In Music Generation
AI Monthly News
OpenAI releases o1, its first model with ‘reasoning’ abilities
ChatGPT Plus and Team users get access to both o1-preview and o1-mini starting today, while Enterprise and Edu users will get access early next week. OpenAI says it plans to bring o1-mini access to all the free users of ChatGPT but hasn’t set a release date yet. Developer access to o1 is really expensive: In the API, o1-preview is $15 per 1 million input tokens, or chunks of text parsed by the model and $60 per 1 million output tokens. For comparison, GPT-4o costs $5 per 1 million input tokens and $15 per 1 million output tokens.
News article: Click here
o1 Model Card: Click here
Introducing Orion, METAs First True Augmented Reality Glasses
Meta recently announced a new version of its Ray-Ban smart glasses, integrating advanced AI features. These glasses are equipped with custom-designed speakers, directional audio, and a 12 MP camera, enabling high-quality photos and videos. With Meta AI integration, users can interact hands-free through voice commands, livestream directly to social media platforms, and receive real-time feedback or assistance.
The glasses also support voice-activated functionalities, such as answering questions or providing contextual information based on the user’s environment. This new release positions Meta’s AR glasses as a blend of hardware innovation and AI capabilities, offering a more interactive and immersive experience.
News Article: Click here
Meta’s Announcement: Click here
MORE OpenAI drama
According to The Times and others, OpenAI is undergoing a significant transition as it seeks to become more appealing to external investors. This includes a shift towards becoming a for-profit business and potentially raising one of the largest funding rounds in recent history, which could increase its valuation to around $150 billion. Despite this, multiple high ranking employees resigned last week, including Chief Technical Officer Mira Murati, Chief Research Officer Bob McGrew, and VP of Research Barret Zoph. All who departed posted messages statements stating they are resigning to explore new opportunities or take a break, and are totally supportive of OpenAI.
- OpenAI CFO tells investors funding round should close by next week despite executive departures
- As OpenAI CTO and two others depart, Altman denies link to restructuring plans
- Turning OpenAI Into a Real Business Is Tearing It Apart
Editor’s Special
- [EEML'24] Michael Bronstein - Geometric Deep Learning: Click here
- Stanford ECON295/CS323 I 2024 I Business of AI, Reid Hoffman: Click here
- What’s the future for generative AI? — The Turing Lectures with Mike Wooldridge: Click here
- Stanford CS229 I Machine Learning I Building Large Language Models (LLMs): Click here
🤝 Join the Conversation: Your thoughts and insights are valuable to us. Share your perspectives, and let’s build a community where knowledge and ideas flow freely. Follow us on Twitter and LinkedIn at RealAIGuys and AIGuysEditor.
Thank you for being part of the AIGuys community. Together, we’re not just observing the AI revolution; we’re part of it. Until next time, keep pushing the boundaries of what’s possible. 🚀🌟
Your AIGuys Digest Team