Overview
Position Title : Senior Big Data Engineer (Python / PySpark)
Locations / Flex : Pittsburgh PA - Two PNC Plaza or Cleveland OH - Strongsville Technology Center. The hiring team is not sourcing outside these hubs at this time. Working arrangement is Hybrid : 3 days in office, 2 remote. Acceptable time zones : EST. Working days : Mon–Fri, 40 hours. Working hours (flexible) : 8 : 00 a.m. – 5 : 00 p.m. EST.
Responsibilities
- Interact with the business to understand evolving requirements and adapt accordingly.
- Understand business requirements and tech stack involved; guide the team toward an integral solution approach.
- Demonstrate dedication and strong communication skills; participate in Agile ceremonies.
- Hands-on experience with object-oriented programming, including its limitations.
- Familiarity with cross-platform development.
- Manage database storage and retrieval on data lake platforms; extensive experience in handling.
- Extensive Unix / FTP / file handling.
- Strong hands-on experience with SQL databases.
Must Have Technical Skills
Deep understanding of multi-process architecture and the threading limitations of Python.PySpark data engineering skills.Experience using libraries like Pandas, NumPy and MatPlotLib; experience with file-based systems (CSV / Parquet).Extensive knowledge in understanding data and creating data pipelines.Hands-on experience designing and implementing APIs.Hands-on experience building microservices using FastAPI or related technologies.Experience writing unit tests and ensuring code coverage.Experience with Agile methodology / JIRA / Confluence.Experience in version control using Git.Flex Skills / Nice to Have
Added advantage if you have worked on Big Data, PySpark libraries.Education / Certifications
Bachelors Preferred
Python or Spark certifications are a plus
Role Differentiator
Conversion possibility and growth opportunities are strong, with ongoing work and energizing new opportunities. The role supports the bank in a critical area and contributes to overall success.
Skills
Deep understanding of multi-process architecture and Python threading limitations.Experience in version control using Git.Experience with Pandas, NumPy and MatPlotLib; file-based systems (CSV / Parquet).Experience with Agile methodology / JIRA / Confluence.Experience writing unit tests and ensuring code coverage.Understanding data and creating data pipelines.Hands-on experience building microservices using FastAPI or related technologies.PySpark data engineering skills.How to Apply
Share your resume with Ariz Khan at ariz.khan@systemone.com. Also connect on LinkedIn : Ariz J. Khan.
Ref : #404-IT Pittsburgh
J-18808-Ljbffr