Performance Monitoring Engineer

Apple

In this highly visible role, you will have the responsibility of ensuring that Apple’s world class Silicon Engineering Group will have the infrastructure and tools needed to engineer and design the worlds most advanced silicon devices and products. You will utilize your deep understanding of developing and maintaining monitoring, alerting, and metrics gathering/analysis systems for diverse applications and systems. You will utilize your extensive communication skills to interface with internal teams, enabling Apple’s world-class product development.

Key Qualifications

  • Typically requires at least 5+ years of experience in a large scale environment (larger than 5000 devices). Finance, Oil & Gas, Scientific Research, or R&D environments preferred.
  • Experience monitoring a large scale distributed computing environment
  • Proficiency in Ruby, Python, Go, or other common high level scripting/programming languages is a must
  • Experience with monitoring tools (Nagios, Sensu, SolarWinds, DataDog)
  • Experience with deploying/developing dashboards (Grafana/Kibana/Dashing)
  • Experience with scaling monitoring solutions over multiple datacenters
  • Experience with aggregation and visualization such as the Elastic stack (ELK) or Splunk
  • Experience with time series databases (Graphite/InfluxDB/OpenTSDB)
  • Experience with configuration management tools (Puppet preferred)
  • Linux system administration experience (RedHat/CentOS preferred)
  • Understanding of revision control systems (Git experience desired)
  • Experience with sFlow and SNMP
  • Experience with Linux containers
  • Familiarity with stream processing systems like Kafka, Spark, or Storm for large-scale data analytics
  • Understanding of NFS and NAS appliances (NetApp preferred)
  • Understanding of Layer 2 / Layer 3 networking
  • Understanding of LDAP
  • Must be analytical and possess strong organizational / problem-solving skills.

Description

Your main responsibility will be supporting internal teams by developing, deploying, and maintaining tooling that will aid in the monitoring ecosystem. Your role will directly impact the development, enhancement, and maintenance of compute clusters, storage systems, network interconnects, and internally developed applications.

Education

MS/BS Degree or equivalent.