Site Reliability Engineer - SaaS Operations (Charles River); AVP

  • Location: Dublin, Leinster, Ireland
  • Salary: Competitive
  • Job Type: Full time

Site Reliability Engineer - SaaS Operations (Charles River); AVP

Charles River provides an ene-to-end solution to autmomate front and middle office investment management functions across asset clases on a single platform. Delivered as a hosted service, the solution improves data quality and investment professional productivity, controls risk and lowers technology costs. Charles River serves more than 350 investment firms over 40 countries in the institutional asset and fund management, private wealth, alternative investments, insurance, banking and pension markets. Charles River were acquired by State Street in October 2018.

Background: Charles River is rapidly growing it's Software-as-a-Service (SaaS) platform. We are looking to hire a hands on, well rounded solution designer for our expanding Infrastructure Support team. This team is responsible for the design, administration and daily operation of critical systems used to support our client's production, disaster recovery and test environments. The team also assists in the development of new support services and architects our current infrastructure.

Role: To design, implement and support SRE practices and technologies supporting scalability, performance and availability of Charles River's global SaaS platform. Drive transformation of monitoring tools and mentor and share knowledge with other members of the team. Collaborate with the SaaS Operations organization, Product and Engineering to enable a rapid feedback loop supporting quality and service delivery.
  • Monitor and report on the availability and performance of our SaaS platform.
  • Drive capacity planning through statistical analysis of log and monitoring data.
  • Design and deploy full stack application monitoring and log analysis platforms.
  • Support rapid and accurate root cause analysis through log and monitoring data analysis.
  • Automate provisioning and management of monitoring platforms, collectors, servers, databases and configuration data.
  • Assist in developing and tuning VMware design.
  • Enhance incident management practices.
  • Ensure interoperability with our configuration management architecture.
  • Utilise version control and test automation technologies to ensure reliability and availability of provisioning automation.
  • Participate in incident response and troubleshooting efforts as issues arise.
  • Participate in periodic weekend maintenance rotation.

Requirements:
  • Extensive experience in SaaS SRE, support, system administration, infrastructure management and automations.
  • Understanding of OSI and TCP/IP stacks.
  • Experience with full stack and cloud monitoring solutions such as Dynatrace, AppDynamics or Solarwinds.
  • Experience with log analysis platforms usch as Splunk, Elasticsearch, Logstash, Kiban, Sumo Logic.
  • Experience with Microsoft stack including Windows Server, SQL Server, Powershell and Active Directory.
  • Experience tuning and supporting production JRE Java server applications.
  • Excellent soft skills including interpersonal, verbal and written communication, customer service, teamwork, ability to multi-task, organizational skills, keen attention to detail, ability to deal with pressure.
  • Significant experience with VMware.
  • Familiarity with public cloud vendors platforms AWS and Azure.
  • A high degree of analytical, technical aptitude and troubleshooting skills.
  • Performance tuning Java server running on VMware.
  • Experience building and running NOC and SOC.
  • Familiarity with Ansible, Rundeck and Git.
  • Experience tuning Java server running on VMware.
  • Experience building and running NOC and SOC.
  • Experience with any major workload automation platform.
  • Financial services/trading system experience advantageous.

Qualifications:
  • University Degree or equivalent in a technical discipline.
  • Professional IT Certifications desirable.