Data Validation at Scale, Stable Diffusion Fine-Tuning, and How AI Will Affect Data Professionals

ODSC - Open Data Science
ODSCJournal
Published in
Sent as a

Newsletter

4 min readJun 8, 2023

Data Validation at Scale — Detecting and Responding to Data Misbehavior

In this tutorial, we’ll introduce the concept of data logging and discuss how to validate data at scale by creating metric constraints and generating reports based on the data’s statistical profiles using the whylogs open-source package.

Faster Stable Diffusion Fine-Tuning with Ray AIR

Ray AI Runtime (AIR) is a scalable and unified toolkit for ML applications. Here’s how you can use it to scale and accelerate the fine-tuning process of a stable diffusion model.

AI Weather Prediction: How Does it Work?

Weather forecasting is very reliant on computer technology to perform accurately. Here’s a bit more on the science behind AI weather prediction.

Supercharge Your LangChain Workflow with Jina AI’s Inference: Seamless Integration for Powerful Applications

This is how Jina AI’s Inference integrates with LangChain, allowing developers to build cutting-edge applications with ease.

How Will AI Affect the Role of Data Professionals?

Most of the discussion surrounding AI and workers has been around writers, software engineers, admin workers, etc, but not so much about data professionals. Here’s what those in data should think about.

Collateral Damage in the Battle Over Truth

Who is ignored and harmed in our AI-powered battle over truth and evidence? What can we do to address this?

AI Girlfriends and Other Ridiculous Examples of Using Generative AI

From influencers making AI girlfriends to a fake news generator, here are a few weird and ridiculous examples of using generative AI.

Just announced — the ODSC Europe free Virtual Pass! Get access to all virtual talks, the virtual AI Expo and Demo Hall, virtual demo talks, and networking events. Register for free!

Attorney Admits to Using ChatGPT for Case Research

A lawyer out of New York is in hot water after his firm used ChatGPT for legal research. He was “unaware that its content could be false.”

New Statement Signed by the Likes of OpenAI’s Sam Altman Warns of AI’s Extinction Risk

A new statement by the Center for AI Safety, a San Francisco-based not-for-profit, warns of the existential risks associated with AI.

Italy Eyeing State-Backed Fund to Promote AI Startups

In a bid to boost native AI startups, Italy is eyeing a state-backed fund to promote its own market.

White House Looks to New Road Map for Handling Artificial Intelligence

Recently, the Biden administration released a series of documents that it calls a “road map” on the issue of artificial intelligence.

All of the Free Virtual Sessions Coming to ODSC Europe 2023

Looking for some data science content to keep you busy June 14th-15th? You can access all of these sessions for FREE with an ODSC Europe 2023 Virtual Pass.

Generative AI Sessions Coming to ODSC Europe

Covering topics like building Stable Diffusion APIs and Neurosymbolic AI, these are some of the generative AI sessions coming to ODSC Europe next week.

All AI and Machine Learning Solutions Coming to ODSC Europe 2023

Led by industry giants like Microsoft Azure, Taipy, and SAS, these are a few machine learning solutions providers coming to ODSC Europe this June 14th-15th.

Partner Event: MLOps for Gen AI

Tuesday, June 27th

The influx of new tools like ChatGPT sparks the imagination and highlights the importance of Generative AI and foundation models as the basis for modern AI applications. Beyond the hype, operationalizing these large models securely in user-facing production applications is a new and complex MLOps challenge. In this session, we’ll share MLOps best practices based on real Gen AI use cases.

The Most Popular In-Person Sessions from ODSC East 2023

Now that ODSC East is over, these are the top sessions that packed rooms and left attendees buzzing.

ODSC West Call for Speakers

Interested in sharing your thought leadership, expertise, or use cases with the data science community? Learn more about how you can speak and present at ODSC West here!

Video of the Week: Unlocking the Power of Large Language Models

Enjoy this remarkable keynote from MosaicML’s Hagay Lupesko as he discusses the power of Large Language Models and explores why owning your own model is critical — and surprisingly within reach.

Upcoming Webinars:

Overcoming External Data Hurdles and Enriching Predictive Forecasts at Scale

Thu, Jun 22, 2023 12:00 PM — 1:00 PM EDT

Join this interactive session with experts from Ready Signal as they explore strategies to mature your data integration and decision intelligence processes.

How Programmatic Feature Discovery Changes the Data Science Workflow

Thu, Jun 29, 2023 12:00 PM — 1:00 PM EDT

In this talk, we will review automated feature engineering technology and discuss how data scientists can benefit from this technology to transform your data and enable AI applications.

A Path to Insights Starts with Trusted Data: Accelerating Decisions with Third-Party Data in Financial Services

Tue, Jul 18, 2023 12:00 PM — 1:00 PM EDT

Your ability to make confident decisions based on relevant factors relies on accurate data filled with context. That’s why enriching your analysis with trusted, fit-for-use, third-party data is key to ensuring long-term success.

--

--

ODSC - Open Data Science
ODSCJournal

Our passion is bringing thousands of the best and brightest data scientists together under one roof for an incredible learning and networking experience.