Necdet YasarBig Tech’s Strategic Moves: AI Advancements and Industry ShiftsFrom Google’s new AI talent acquisition to OpenAI’s venture into safety and investment, and Nvidia’s chip delays, tech giants are shaping…8h ago
Linda MargaretinBrain LabsWhen the Money Is Real, but the Product Is FakeThe illusion of AI safety in an unsafe worldJul 269
Jonathan DavisUnderstanding Anthropic’s Golden Gate ClaudeAnthropic’s research into monosemanticity can improve language model interpretability and safetyJun 271Jun 271
Ali WaseemGPT-4o System Card By OpenAIComprehensive Safety Measures for GPT-4o Deployment2d ago2d ago
Simone TedeschiinGenerative AILLMs: How Safe Are They?Red Teaming Large Language Models with ALERTMay 22May 22
Necdet YasarBig Tech’s Strategic Moves: AI Advancements and Industry ShiftsFrom Google’s new AI talent acquisition to OpenAI’s venture into safety and investment, and Nvidia’s chip delays, tech giants are shaping…8h ago
Linda MargaretinBrain LabsWhen the Money Is Real, but the Product Is FakeThe illusion of AI safety in an unsafe worldJul 269
Jonathan DavisUnderstanding Anthropic’s Golden Gate ClaudeAnthropic’s research into monosemanticity can improve language model interpretability and safetyJun 271
Simone TedeschiinGenerative AILLMs: How Safe Are They?Red Teaming Large Language Models with ALERTMay 22
Vincent CaldeiraMarketMaestro: Building and Aligning a Local AI Stock Advisor Agent with InstructLab, Podman AI Lab…As AI systems become more integrated into sensitive domains like finance, healthcare, and legal services, ensuring their alignment with…Aug 4
Ai2inAi2 BlogOpen research is the key to unlocking safer AIThe last few years of AI development have shown the power and potential of generative AI. Naturally, these leaps in machine intelligence…3d ago
Rachel Draelos, MD, PhDinAI AdvancesAI Alignment and Moral Philosophy for Artificial General IntelligenceIn this post I summarize a few of my thoughts on alignment and moral philosophy for safe artificial general intelligence.Mar 1913