At Automattic, our deep appreciation for open source software is evident through our active contributions, with a primary focus on WordPress. Recently, we integrated Apache Superset, to help support our intricate data visualization needs. One notable achievement includes the automation of dataset creation, a solution that not only resolved an issue but also enhanced our data discoverability.
… Continue readingCategory: Data Engineering
The Data Docs: A WordPress based Data Discoverability Tool
The Data Docs is Automattic's new home made data discoverability solution. It collects and publishes our datasets' metadata to WordPress so it is accessible and searchable by our users. Here we describe the idea from its conception and explain how it works.
… Continue readingChallenge Hiring Assumptions with Data
We use experiments to model reality (sometimes to create alternative realities as in A/B experiments), to understand reality, and ultimately, to make decisions moving us ever closer to our goals. Improving iteratively, we learn not only from successful experiments but also from failed attempts. Experiments are important because they provide us with measurements. And measurements … Continue reading Challenge Hiring Assumptions with Data
SQL — a Common Language for the Whole Data Team
Today, I wanted to share how we’ve empowered colleagues outside the data engineering team to write their own data transformations by overcoming coding language barriers. Within our Data team, there are several specializations: Data Analysts create dashboards and analyze data for business leads and product teams across the company.Data Scientists apply machine learning technology at … Continue reading SQL — a Common Language for the Whole Data Team
Building Thousands of Reproducible ML Models with pipe, the Automattic Machine Learning Pipeline
Demet takes you deep into pipe, a tool that allows anyone at Automattic to build solid machine learning models.
… Continue reading