Hello

I'm Amar Chaudhari

Infrastructure Enthusiastic

  • ADDRESS 3700 Casa Verde St, San Jose, CA
  • EMAIL contact@amarchaudhari
  • CURRENT STATUS Open to Opportunities

Download Resume

I am a passionate SRE with experience in large-scale production systems, automation first mindset, and well versed in numerous technologies including Linux, Switching/Routing, and DevOps

Professional Skills

Linux (CentOS-6/7) 90
Shell Scripting 80
Python 90
Switching 60
Routing (RIP/OSPF/BGP) 60
VXLAN 50
Juniper Firewalls(SSG240,SRX3600) 70
Cisco Switches/Routers 60
Django 70

Education

2016 - 2018

Master of Telecommunication (Network Engineering)

University of Colorado Boulder

2009 - 2013

Bachelor Information Technology

Pune Institute Of Computer Technology

Work Experience

2018 - Present

Sr. Site Reliability Engineer

LinkedIn

- Responsible for reliability, resiliency, and performance of over 30 million Company Pages, Products Marketplaces and Company Search on LinkedIn.
- Lead the production readiness of Company Pages services including performance tuning, JVM heap tuning, migrating/tuning G1 garbage collector that resulted in a 40% reduction in GC time and handles 350,000 qps of site traffic.
- Lead the cluster capacity planning for the Company Pages services.
- Identified SLI’s and established SLO’s & SLA’s for monitoring & alerting.
- Lead a team of engineers to strategize, design, and develop a tool to automate availability triaging that reduced toil by 50% for the Product SRE.
- Authored technical sessions for the development team to improve reliability and resilience in the services.
- Lead post mortems for Company Pages and worked with development teams to drive resolution of the action items.
- Worked with TPMs & Managers to finalize quarterly OKRs and sprint planning for the team.
- Mentored new team members to effectively onboard to the LinkedIn stack and Company Pages services.
- Participated in a 24x7 on-call rotation for production services.

2017 - 2018

Graduate Student Assistant - Network Operations

University of Colorado Boulder

- Working with the NOC of the University Of Colorado
Boulder to build and maintain campus network.
- Setup Virtualization Environment using ESXi & KVM and
AOE SAN Datastore.
- Setup 3 Node Proxmox cluster with CoRaid AOE SAN
Datastore.
- Created VMs and Setup tools - Arista CVP, Rancid,
Smokeping, Zabbix, sFlowtrend, Observium etc.
- Setup a POC environment for VXLAN testing.

2017 - 2017

Site Reliability Engineering Intern

LinkedIn

- Worked with the GRID SRE team which is responsible for
all the Hadoop clusters in LinkedIn.
- Designed and built a tool to fingerprint error logs from
10,000 Hadoop nodes in near real-time.
-The tool consumed on an average 50,000 logs/minute
and provided visual representation of the cluster state.

2013 - 2016

Network Engineer

Rakuten Inc.

- Worked with the Network Development Team to design,
plan and deploy datacenter networks.
- Configured Routing Policies (Static/OSPF/BGP), ACLs and
VPN tunnels.
- Configured Cisco Nexus/Router 3650, 5548, 5596, 7k,
ASR.
- Configured Juniper Firewall SSG2000, SRX3400.
- Configured F5 BIG-IP i5000, i2000 Load Balancers.
- Configured Monitoring Tools Such As MRTG, PRTG,
SolarWinds.
- Lead the Automation Team which focused on automating
mundane and time consuming tasks such as adding
VLAN’s to 1000 access switches, configuring ACL’s, device
configuration backups etc.

My Interests

Site Reliability Engineering Infrastructure Automation Distributed Systems Micro Services

Comments (0)

Bitnami