Open in app

Sign In

Write

Sign In

Mary Newhauser
Mary Newhauser

647 Followers

Home

About

Published in

Towards Data Science

·Pinned

Understanding ChatGPT Plugins: Benefits, Risks, and Future Developments

Expect improvement, not perfection. — This article was originally published on GPTech. When ChatGPT was first released in late 2022, its capabilities were simultaneously impressive and unimpressive. It could rap battle and write differential equations in LaTeX but didn’t know anything about the war in Ukraine and sometimes couldn’t even do simple math. The stark…

Data Science

13 min read

Understanding ChatGPT Plugins: Benefits, Risks, and Future Developments
Understanding ChatGPT Plugins: Benefits, Risks, and Future Developments
Data Science

13 min read


Published in

Towards Data Science

·Pinned

GPT-4 vs. ChatGPT: An exploration of training, performance, capabilities, and limitations

GPT-4 is an improvement, but temper your expectations. — OpenAI stunned the world when it dropped ChatGPT in late 2022. The new generative language model is expected to totally transform entire industries, including media, education, law, and tech. In short, ChatGPT threatens to disrupt just about everything. And even before we had time to truly envision a post-ChatGPT world…

Data Science

7 min read

GPT-4 vs. ChatGPT: An Exploration of Training, Performance, Capabilities, and Limitations
GPT-4 vs. ChatGPT: An Exploration of Training, Performance, Capabilities, and Limitations
Data Science

7 min read


Published in

Towards Data Science

·Pinned

Making the Jump from Data Analyst to Data Scientist in 2023

The skills and resources you need to transition from a data analyst to data scientist position. — Imposter syndrome, frustration, doubt. These are just a few of the things I experienced while trying to make the jump from data analyst to data scientist in 2018. In the five years since, advances in artificial intelligence have introduced world-changing technologies such as large language and transformer models, diffusion models…

Data Science

12 min read

Making the Jump from Data Analyst to Data Scientist in 2023
Making the Jump from Data Analyst to Data Scientist in 2023
Data Science

12 min read


Published in

Towards Data Science

·Pinned

The ultimate reference for clean Pandas code

A clean way to clean data — Pandas can transform even the messiest data into pristine machine learning datasets. The process itself, though, can be quite messy. Pandas code can be hard to read for a number of reasons. For one thing, there are many different ways of accomplishing the same basic tasks in Pandas. Subsetting data…

Python

5 min read

The ultimate reference for clean Pandas code
The ultimate reference for clean Pandas code
Python

5 min read


Published in

NLPlanet

·Pinned

Fine-tuning DistilBERT on senator tweets

A guide to fine-tuning DistilBERT on the tweets of American Senators with snscrape, SQLite, and Transformers (PyTorch) on Google Colab. Introduction Tweets are short bits of text that can (sometimes) be packed with valuable data. In the case of United States Senators, official Twitter accounts contain their opinions on a wide…

Machine Learning

10 min read

Fine-tuning DistilBERT on senator tweets
Fine-tuning DistilBERT on senator tweets
Machine Learning

10 min read


Published in

Towards Data Science

·May 2

PyCon gems: A Curated Selection of Exceptional Talks from PyCon DE 2023

LLMs in isolation are not the future. — The excitement and tension in the air were palpable as crowds lined up only to be turned away as conference rooms filled to capacity at PyCon DE 2023 in mid-April in Berlin. The release of ChatGPT mere months before set off an AI frenzy, sparking a tsunami of innovation and…

Data Science

8 min read

PyCon gems: A curated selection of exceptional talks from PyCon DE 2023
PyCon gems: A curated selection of exceptional talks from PyCon DE 2023
Data Science

8 min read

Mary Newhauser

Mary Newhauser

647 Followers

Senior Data Scientist at Wiley.

Following
  • ODSC - Open Data Science

    ODSC - Open Data Science

  • The PyCoach

    The PyCoach

  • Soner Yıldırım

    Soner Yıldırım

  • Fabio Chiusano

    Fabio Chiusano

  • John Adeojo

    John Adeojo

See all (47)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech

Teams