Welcome tothe Data-Centric AI Community

Are you intrigued by the possibilities of Data-Centric AI, and you want more?

Join like-minded experts, thought-leaders and peers at the Data-Centric Community!

SlackJoin us on Slack GithubJoin us on Github

Want to understand Data-Centric AI better?

Data-Centric AI is the process of building and testing AI systems by focusing on data-centric operations (i.e. cleaning, cleansing, pre-processing, balancing, augmentation) rather than model-centric operations (i.e. hyper-parameters selection, architectural changes).

The term is coined by Andrew Ng. The definition is provided by Marco Postiglione and chosen by the community itself.

The Data-Centric AI Community is the place to discuss data quality for data science.

data-profiling

Data Profiling

Understanding the existing data is the first step. Profile your data in a few lines of code.

Explore your data with pandas-profiling!0
synthetic-data

Synthetic Data

Synthetic data is artificially created data that keeps the original data properties, ensuring its business value while being privacy compliant.

Expand your data with ydata-synthetic!0
data-labeling

Data Labeling

Isn’t it one of your biggest pain points in data quality? The DCAI Community cultivates meaningful discussions around this and other topics!

Coming soon

Want to learn more?

image

Introducing the Data-Centric AI Community

A place to discuss data quality for data science.

Learn MoreGo to
image

From model-centric to data-centric

A new paradigm for AI development — focused on data quality.

Learn MoreGo to
image

Pandas Profiling for Quicker Data Understanding

Read your data? Pause. Generate the Pandas Profiling report first.

Learn MoreGo to
image

Synthetic Time-Series Data: A GAN approach

Generate synthetic sequential data with TimeGAN.

Learn MoreGo to
image

How to Validate the Quality of Your Synthetic Data

A tutorial on how you can combine ydata-synthetic with great expectations.

Learn MoreGo to
image

Versioning and Labeling — Better Together

Data labeling and data versioning provide a rock solid bedrock to build your machine learning models on now and in the future.

Learn MoreGo to

Stay updated with our Newsletter

Sign up to The Gaussip, our weekly newsletter, with nuggets of information to keep up with the hottest topics, exciting advancements in the space of AI, and funny memes. Stay in touch!