Hi Friend
Hope you are doing well
Number of position : 6
Only Full Time
I, Salman Shaikh would like to share a job opportunity as Big Data (PySpark) Tech Lead based in Irving, TX / Jacksonville, FL / Jersey City, N J (Onsite) location for a Fulltime position.
- In case, if you are not comfortable with this location, please share your preference with contact details for further requirements
Kindly find the JD below and let me know if you are available for the same.
Big Data (PySpark) Tech Lead
Job Location : Irving, TX / Jacksonville, FL / Jersey City, NJ(Onsite)
Duration : Full time
Big Data (PySpark) Tech Lead-
10 + Years Overall Experience in Data Management, Data Lake and Data Warehouse6+ Years Hadoop, Hive, Sqoop, SQL, Teradata6+ Years PySpark(Python and Spark), UnixGood to have Industry leading ETL experienceBanking Domain experienceKey Responsibilities
Ability to design, build and unit test applications on Spark framework on Python.Build PySpark based applications for both batch and streaming requirements, which will require in-depth knowledge on majority of Hadoop and NoSQL databases as well.Develop and execute data pipeline testing processes and validate business rules and policiesOptimize performance of the built Spark applications in Hadoop using configurations around Spark Context, Spark-SQL, Data Frame, and Pair RDD's.Optimize performance for data access requirements by choosing the appropriate native Hadoop file formats (Avro, Parquet, ORC etc) and compression codec respectively.Ability to design & build real-time applications using Apache Kafka & Spark StreamingBuild integrated solutions leveraging Unix shell scripting, RDBMS, Hive, HDFS File System, HDFS File Types, HDFS compression codec.Build data tokenization libraries and integrate with Hive & Spark for column-level obfuscationExperience in processing large amounts of structured and unstructured data, including integrating data from multiple sources.Create and maintain integration and regression testing framework on Jenkins integrated with BitBucket and / or GIT repositoriesParticipate in the agile development process, and document and communicate issues and bugs relative to data standards in scrum meetingsWork collaboratively with onsite and offshore team.Develop & review technical documentation for artifacts delivered.Ability to solve complex data-driven scenarios and triage towards defects and production issuesAbility to learn-unlearn-relearn concepts with an open and analytical mindsetParticipate in code release and production deployment.Challenge and inspire team members to achieve business results in a fast paced and quickly changing environmentPlease reply me with your updated resume and required details :
Full Name :
LinkedIn ID (Must To have as per exp)
Best number to reach you :
Work authorization / Visa Status :
Current Location :
Current Compensation :
Expected Compensation :
Best time to call you :
Waiting for your earliest response
Thanks & Regards
Salman Shaikh
1 781-896-2152 (Cell)Boston, MA