Writing Samples

I've written a lot over the years. Top level stats: over 500,000 words, almost 1,400 posts, and 2,400 comments published internally at Automattic (excluding Slack, Basecamp, and Github). I cannot share all of these, but I will share some examples below.

Other than writing samples, I'd also like to emphasize that I've received a long list of kudos shared by my team members over the years. Kudos is our way of internally recognizing and appreciating coworkers in a public way.

Note: most of these are screenshots. You can enlarge them by clicking on them.

P2s (internal blogs at Automattic)

How I Work

This is a little outdated, but it highlights how I worked back when I did support, including workflows and the apps I use(d). Still, it's a good example of how I'd write a pretty general post that's not directly work-related:

How I Work: Daniel Danilov

I also document my own learning journey, but most of that is in my personal notes files. Only very few make it to a personal blog I keep, usually those that provide official credentials/certifications. For example, here are all the blog posts I published while learning SQL/Python for my data work at Automattic:

DataCamp Courses

Project: Scope of Support

Here's one example of a project thread I can safely share. It follows a similar communication process to that of 37signals, where updated on progress are expected, where division leads can follow the project thread and receives notifications for every update:

Data Analytics

Automation Script

We use a special way of writing SQL internally, which often means changing variables into absolute values when testing locally (as opposed to publishing the end result in Github). This can be very frustrating, and I used an Apple Script to solve this problem, and then shared it with the data division:

Automating the Removal of ${} from Table Names in DataGrip Using Automator and AppleScript

Template for structuring data reports

This is an example of writing documentation for a process that wasn't previously defined, but had to be to ensure consistency among data reports published by the various data teams:

New Deep Dive Analysis Scratchpad Available!

Learning Tumblr Analytics

This is an example of where I've gone through an in-depth SQL analysis on Tumblr, only to learn that the information is stored elsewhere. Tumblr has gone through 4 acquisitions, and the data is quite messy. Thankfully, someone knew where to look, and it gave me an opportunity to complete the analysis accurately (while still keeping the wrong approach so it doesn't happen again).

Custom Themes on the Tumblr Blog Network

Explaining Retention Rates in A/B Tests

After receiving a data request, we realized that the team is struggling with understanding retention rates. This is a follow up post that I've shared, using the specific experiment as an example, to help explain the process to everyone else in Tumblr.

Understanding Retention Rates in A/B Tests

Exploring Measuring `timespent` Metric

After working on various analyses, I noticed that the way we define timespent in Tumblr is not consistent. This has turned into its own task that I've tackled to help clarify this confusion for everyone. It required digging through a lot of code (primarily Python with embedded SQL) to isolate the exact definitions.

Exploring how we measure timespent

Auditing Data Tables

Finding the right tables has become very tricky, since a lot of the tables had similar names, and were impossible to find the correct one.
Diving deeper into this resulted in clearing 17 terabytes of unnecessary data.

Data Tables Auditing Summary

Replacing all KPI Charts with Superset & Looker

This was a pretty big project I've worked on, reviewing all of Tumblr's KPI charts and replacing them with more modern tools, specifically Superset and Looker (or both). Behind the scenes, this required rebuilding the dashboards in both services, using LookML and custom SQL queries, and making sure the data pipeline is updated via Airflow and updated to our Python code.

Replacing all KPI charts with Superset and Looker; deprecating Vizzie

DAG Optimization and ETL Failure resolution

This is a summary of an issue that was quite long lasting (almost 4 months) and required the full support of Tumblr Analytics as well as our Core Engineering team to tackle. This post aimed to capture the process and the learnings along the way.

User Summary DAG Optimization and ETL Failure Resolved

Understanding A8C UDFs in Spark, Hive, and Trino

Data at Automattic often uses various user-defined functions but these were not documented well, and therefore, made onboarding into the team very confusing. I've taken it upon myself to learn, understand, and document the various UDFs that existed to make it easier for everyone else. It's a good example of how I'd approach documentation.

Updated internal documentation: Understanding A8C UDFs in Spark, Hive, and Trino
Shared an announcement post to ensure everyone knows it exists now: New Field Guide: Understanding A8C UDFs

Public-facing posts

Data.blog

I've also authored a few posts on our public facing blog, data.blog. Two in particular are relevant:

From Support to Data Science and Analytics: My Journey at Automattic.
Summary of a meetup project we've completed. Hack Project: Creating a tool to translate customer feedback into product insights. This is still being used today by Tumblr's support and product teams.

DanielDanilov.com

I also published a couple on my own blog post, though, not nearly as often as I'd like to. For some reason, I find it easy to write for work, but much harder to write for myself. Still, there's one post in particular that I'm pretty happy with: Write to help yourself, publish to help others.

This one is also a good one, though, it's mostly a collection of questions I'd occasionally look at when preparing for 1-1s back when I was a team lead: Questions for 1-1s and teams – a primer for remote communication.

I've pitched Shape Up within Automattic (even though I wasn't a developer), which has since been picked up by multiple teams. I believe in this approach and recently even published about how Shape Up can be used for personal life, not just work. I wrote about this in this post: What If We Used Shape Up in Our Personal Lives?. It was more of a thought exercise at the time.

Now that I'm applying for this junior programmer role, I'm extra sad that I don't have more public resources to pull from!

WordCamp Asia

I've been an organizer in WordCamp Asia 2025. As part of my work, I published a couple of blog posts:

Why You Should Become a Micro Sponsor at WordCamp Asia 2025
Level Up Your Networking Experience at WordCamp Asia: as part of the work that went towards encouraging networking in WordCamps, and my position internally at Automattic, I was able to leverage my connection to the company to launch a new feature in Gravatar.
- This stared off with me pitching the idea to Gravatar team.
- Continued off to become a project thread. I was responsible for actually launching it during the event.
- I summarized the impact here.

Github

While I don't have many decent PRs to share from Open Source contributions, I will share some example PRs from work I've done internally at Automattic that do not reveal any personal information.

Writing Samples

P2s (internal blogs at Automattic)

How I Work

Project: Scope of Support

Data Analytics

Automation Script

Template for structuring data reports

Learning Tumblr Analytics

Explaining Retention Rates in A/B Tests

Exploring Measuring timespent Metric

Auditing Data Tables

Replacing all KPI Charts with Superset & Looker

DAG Optimization and ETL Failure resolution

Understanding A8C UDFs in Spark, Hive, and Trino

Public-facing posts

Data.blog

DanielDanilov.com

WordCamp Asia

Github

People Analytics

Add requester_waiting_time_intervals and _summary metrics for tumblr tickets

Add Zendesk Tumblr insights schema and SQL transformation

Add Zendesk Tumblr Insights Explore/View

Fix duplicates in office_name

Add GDPR and Country custom fields to zendesk tumblr tickets table

Add a8c_devex_trial_status_decided transformation

Tumblr

Add three new intermediate tables for dsa_user_summary DAG

Replacing subqueries with intermediate tables to improve query efficiency

Update syntax in intermediate tables for users summary dag

Debug spam backfill DAG in Tumblr

Add new metrics to stats dashboard

Add skip check flags and wait for DAG completion to dsa_spam_backfill, dsa_marketing, and dsa_messaging

Update kpi_review notebook to include Superset chart links to retire Vizzie links

Very simple PRs

Add new_churned_total_non_spam task

Fix the name of non-spam to spam_tumble_log_users table in dsa_user_summary

Add missing argument to dsa_spam_backfill.py

Add override=True for DROP TABLE statements

Exploring Measuring `timespent` Metric

Fix duplicates in `office_name`

Add `a8c_devex_trial_status_decided` transformation