Site
Reliability Engineer and Backend Software Developer
July 2019- Present Amsterdam, Netherlands
|
Site Reliability Engineer
- Own services written in Java and Perl that protect company's customer-facing services
from external attacks.
- Establish SLOs and associated monitoring and alerting
- Client library development and support.
- Write Post-Mortems and end-user documentation
- Participate in on-call schedule for services.
- Participate in company-wide Datacenter failover drills.
|
January 2017 - March 2019 San Francisco, USA
|
Site Reliability Engineer
- Developed automatic failover service written in Java
and running on Kubernetes, for replicated HBase cluster
pairs to increase availability and reduce need for human
response to instance failures.
- Led infrastructure migration on Amazon Web Services
(AWS) from Chef to Terraform and Salt, improving
scalability and ability for other engineers to launch and
maintain services.
- Developed Terraform and Salt code to provision environments in Google Cloud Platform (GCP).
- Developed tooling for capacity planning estimation to scale AWS resources economically.
- Contributed to kafka-assigner,
an open-source tool to balance Kafka topic partitions
across available brokers.
|
May 2016 - December 2016 Oakland, USA
|
Founder
Created second language learning web application using
Clojure and PostgreSQL to help students learn Italian,
Spanish, and French. Worked closely with a second-language
educator currently using the system in his classes. Released
parts of the source code as an open source library
on Github.
|
June 2013-April 2016 Cupertino, USA
|
Siri Hadoop Operations Engineer
Member of group responsible for deployment, monitoring,
and diagnosing of Hadoop clusters running HBase, Solr,
Flume, Spark and Kafka. Participated in on-call rotation.
- Developed workstation-local distribution of HBase,
allowing developers to test work before cluster
deployment.
|
March 2010-May 2013 Cupertino, USA
|
Led Hadoop Group of 4 members, developed and
maintained internal Hadoop distribution. Developed
automated methods for upgrading and testing Hadoop
clusters.
- Led effort to upgrade internal Hadoop distribution from planning to deployment.
- Helped curate Apache source code for internal distribution.
- Security Enhancements to Apache Zookeeper: ZOOKEEPER-938
- Improvements to Apache Giraph: GIRAPH-168, GIRAPH-211
- Integration of Zookeeper Security enhancements with Apache HBase HBASE-2418
- Numerous minor improvements to Apache Hadoop, Apache Pig, Apache HBase, hadoop-lzo.
|