What are the responsibilities and job description for the Site Reliability Engineer position at Charles Schwab Inc.?
Your Opportunity
At Schwab, you’re empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us “challenge the status quo” and transform the finance industry together.
Our team is responsible for creating an industry leading order management system to support Schwab’s trading environment. This role is for a Site Reliability Engineer to support the Order Management System on a highly performant and scalable compute environment. Some of your responsibilities include responding to alerts and escalations, managing incidents and issues, technical documentation, and independently supporting/configuring complex systems. Proficiency with the Java tools we utilize for automation, delivery, and the software development lifecycle will be critical to your success. In addition to your technical work, you would be collaborating with the Development, QA, and SRE teams to optimize the environments and tooling. These responsibilities will expose the associate to a greater support role and a larger impact on the direction and success of our systems.
This role will require the ability to regularly work in a Schwab headquarters location. Immigration sponsorship is not available for this role.
- High sense of urgency with the ability to calmly collaborate within the team and the wider Schwab technology organization
- Analytical, organizational, and problem solving/decision making skills
- Troubleshooting, researching, and resolving major defects, questionable functions, errors, and inconsistencies in systems functions, outputs, and content
- Quick response to system monitors, alerts, escalations and all outages
- Provide 24/7 on-call support on rotating schedule including specific designated systems
- Study software documentation to understand our application’s business and technical details to provide efficient support
- Analyze issues to recommend improvements and avoid recurring issues
- Work with peers to resolve complex application issues
- Provide support guidance and direction for other production support resources
- Monitor and maintain health requirements on servers and track app/system performance
- Acquire general skill sets required for supporting and restoring services for all management designated applications and system
- “Go-to” resource for peers and business stakeholders
- Communicate service disruptions to Management and stakeholders while urgently addressing the technical issues
- Recognize & implement opportunities for improved process or procedures
- Provide excellent service to all Support clients and business partners
- Very detail oriented
- Proven ability to work effectively in a team setting as well as independently with minimal error and guidance
- Ability to draft a plan, socialize for feedback/new input, establish a roadmap & execute providing updates to stakeholders on an established cadence
What you have
To ensure that we have fulfilled our promise of "challenging the status quo," this role has specific qualifications that successful candidates should have.
Required Qualifications
- 4 year college degree or equivalent experience
- 5 years of experience supporting Java based applications in a RHEL environment
- Excellent communication skill set to positively impact application teams and work with stakeholders
- 2 years of Google Cloud Platform support or other cloud platform
- Experience troubleshooting/diagnosing JVM issues e.g. thread dumps, garbage collection and memory management (Java 17 and up preferable)
- Working knowledge of Git VCS a plus
- Experience working with Schwab systems and processes
- Version Control System Management using tools like Bitbucket, Git, and Artifactory in a Java-based environment, 3 year’s experience
- Experience with code reviews
- 3 year’s experience documenting processes and procedures to transition knowledge to junior team members
- Knowledge managing the Linux OS (RHEL 6, 7 & 8 preferred) and use appropriate commands to perform tasks, monitor and optimize for best application performance
- Familiarity with System Automation and Configuration management, utilizing tools like; Puppet, Foreman, Terraform, Salt, Ansible, etc.
- 2 year’s experience enhancing and deploying automated processes to minimize human intervention.
- Knowledge of scripting languages (shell, perl, python, ruby, JSON, XML, YAML)
- Familiarity with Agile SDLC and experience providing input to project meetings and project planning
- Experience with large financial services corporations or similar industry
- Available on weekends or after hours as required
In addition to the salary range, this role is also eligible for bonus or incentive opportunities