Job Description Systems Engineer incorporates aspects of software engineering and applies that to operations with a goal of creating ultra-scalable and highly reliable systems.
Is your CV ready If so, and you are confident this is the role for you, make sure to apply asap.
Involved in all levels of design, development and delivery of these applications and infrastructure.
Helps define the evolving standards and processes that facilitate an efficient and reliable environment, implementing immutable infrastructures within internal data centers and external cloud providers.
Manages software changes and releases and be responsible for implementation.
Responsible for providing 24x7 application and technical support to all production and test environments.
Application release and configuration management.
Pinpoint areas of underdeveloped technology, software practices, and operational procedures that pose a risk to the production environment; actively develop technological solutions and business processes to reduce this risk.
Responsibilities Engineering Responsible for the Automation of Operations & Support for Trading / Business Systems & Applications Assume a leadership role in the continuous improvement of our Software Deployment Automation (SDA) Research and select products to fulfill system reliability engineering requirements (ex.
Tidal, Puppet, Mesos, Git, etc.) Investigate, identify and fix application performance, systemic inadequacies and latent reliability issues Troubleshoot stack-wide engineering issues related to hardware, software, network and application Implement Operational Automation for Monitoring, Managing, Deploying and Validating of Systems / Applications Mentor engineers and team members on methodology, standards and best practices Communicate enhancements through internal presentations, blog articles and documentation Change Management Manage installation and configuration of application software leveraging SDA Manage, schedule and track application changes and releases Define, test, and document installation and fallback of system changes Asses, manage and communicate risks prior to implementation Document specific procedures for implementation and error checking Implementation Management Design implementation strategy and procedures Deploy application software and configuration changes across all Production and Test environments utilizing SDA and our version control system Ensure proper system and application health checks are performed and run successfully following implementation Reliability Testing Create and execute operational test plans / scripts / cases based on technical and business requirements Support the execution of acceptance, performance, capacity, regression and system failover testing on back-end components using automated test procedures wherever possible Support scheduled weekend testing of Production level systems and applications Application Support Provide 2nd level application support for Test and Production environments Troubleshoot and diagnose systems problems and make repairs in areas of responsibility Utilize Jira, Github and ServiceNow to track and manage defects discovered during testing and application support Troubleshoot and diagnose systems problems; install and configure fixes for applications Provide online and offline Level II support for the trading systems (24 x 7 on call) Analyze cause of production problems and develop methods for improvement Define, execute and implement Systems Reliability Standards and Processes Evaluate, design and implement Automation products and initiatives that lead to higher systems / application reliability Define, execute and implement Change Control and Implementation Management for Application functions Define, execute and analyze Operational Acceptance Test initiatives Provide Level II Application Support Requirements Bachelors degree or equivalent knowledge and experience Expert knowledge of Linux servers, specifically RHEL Demonstrable knowledge of TCP / IP Experience of high-level scripting language (Python, etc.) and software design and life cycle Practical knowledge and use of source control systems, Preferably GIT Experience with IT Automation Software (Puppet, Chef, SaltStack) Knowledge of basic network administration Operational testing, change control and implementation management experience Practical experience with monitoring and alerting workflows and technologies General understanding of capacity and performance management Possess the ability to adapt and adjust to rapidly changing land-scape and priorities Strong ability to work independently and prioritize tasks with little or no direction Passionate lifelong learner and innovator that understands that current technologies and problems will continue to change Familiarity with open source software development culture, community and workflows EEOC : 40HRS Staffing is an Equal Opportunity Employer and Prohibits Discrimination and Harassment of Any Kind : 40HRS Staffing is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment.
All employment decisions at 40HRS Staffing are based on business needs, job requirements, and individual qualifications, without regard to race, color, religion or belief, national, social or ethnic origin, sex (including pregnancy), age, physical, mental or sensory disability, HIV status, sexual orientation, gender identity and / or expression, marital, civil union or domestic partnership status, past or present military service, family medical history or genetic information, family or parental status, or any other status protected by the laws or regulations in the locations where we operate.
40HRS Staffing will not tolerate discrimination or harassment based on these characteristics.
System Engineer • New York, New York, US