Human in the loop data processing

Wednesday, December 9, 12:30pm - 12:55pm (PST)

Note: This event is being displayed with the local times in the 'America/Los_Angeles' timezone. When you save this event, it will be shown in the timezone of your calendar.

What do you do when data is too messy to be useful, but too large for manual cleaning? In this talk, Bladey will share their tips for implementing 'human in the loop' data processing — focusing manual efforts on the messiest data. When their team implemented this approach, a data cleaning task that used to take two months was reduced down to two weeks.


Join in the chat in the #coalesce-human-in-the-loop-data channel. If you're not yet a member of dbt slack, sign up here.

Add to Calendar 2020/12/09 12:30:00 2020/12/09 12:55:00 America/Los_Angeles Human in the loop data processing Note: This event is being displayed with the local times in the 'America/Los_Angeles' timezone. When you save this event, it will be shown in the timezone of your calendar.

What do you do when data is too messy to be useful, but too large for manual cleaning? In this talk, Bladey will share their tips for implementing 'human in the loop' data processing — focusing manual efforts on the messiest data. When their team implemented this approach, a data cleaning task that used to take two months was reduced down to two weeks.


Join in the chat in the #coalesce-human-in-the-loop-data channel. If you're not yet a member of dbt slack, sign up here.

https://www.crowdcast.io/e/coalesce2020/20 false MM/DD/YYYY OPAQUE avsghYRcFzLmpEOJAmCT101346

Wednesday, December 9, 12:30pm - 12:55pm (PST)

https://www.crowdcast.io/e/coalesce2020/20