Technology Lead - Spark Scala Developer

1 month ago• Mississauga, Canada

Infosys is seeking an experienced Spark Scala Developer to design, develop, and optimize scalable bigdata solutions. The candidate will work on building high-performance batch and real-time data pipelines leveraging the Hadoop ecosystem and distributed computing frameworks(Spark). The role involves working closely with data engineers, architects, and business stakeholders to deliver robust, scalable, and efficient data processing systems.

Required Qualifications:

Candidates authorized to work for any employer in Canada without employer-based visa sponsorship are welcome to apply. Infosys is unable to provide immigration sponsorship for this role at this time.
Candidate must be located within commuting distance of Mississauga, Ontario or be willing to relocate to the areas.
Bachelor's degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
At least 4 years of Information Technology experience
4+ years of experience in Big Data technologies.
Strong expertise in:

Want more jobs like this?

Get Data and Analytics jobs in Mississauga, Canada delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

Apache Spark (Core, SQL, DataFrames, RDDs)
Scala programming
PySpark

Hands-on experience with:

Kafka (real-time streaming)
Hadoop ecosystem (HDFS, Hive, Impala)
NoSQL Databases (HBase, MongoDB, Couchbase)

Strong understanding of distributed computing concepts and data processing frameworks.

Experience in building ETL/data pipelines for large-scale datasets.

Proficiency in SQL and data modeling.

Preferred Qualifications:

Hands-on experience with data lakes, data warehouses, and scalable ETL pipeline design, including batch and real-time processing architecture.
Strong understanding and practical exposure to Agile software development methodologies (Scrum) and SDLC practices.
Proven experience in Banking domain, supporting use cases such as fraud detection, risk analytics, regulatory reporting, and customer insights.
Excellent analytical, problem-solving, and communication skills, with the ability to translate business requirements into scalable technical solutions.
Demonstrated ability to work effectively in cross-functional, multi-stakeholder environments, collaborating with Business, Data Engineering, and Architecture teams.
Experience with real-time data streaming frameworks such as Kafka and Spark Streaming for low-latency processing.
Understanding data modeling concepts (dimensional modeling, snowflake schemas) to support analytics workloads.
Experience and desire to work in a global delivery environment.

Key Responsibilities:

Design and develop large-scale data processing pipelines using Apache Spark (Scala & PySpark)
Build and optimize batch and real-time data processing workflows using Spark, Kafka, and Hadoop ecosystem
Develop Spark applications using RDDs, DataFrames, and Spark SQL for complex transformations
Develop and optimize PySpark applications leveraging joins, Spark DAG execution flow, stage optimization, transformation techniques, and streaming with dynamic allocation and failover handling.
Implement streaming pipelines using Kafka and Spark Streaming / Structured Streaming
Develop and maintain HDFS, Hive, NoSql and Impala-based data lake solutions
Convert existing SQL/Hive workloads into optimized Spark jobs for improved performance
Work with ETL pipelines to ingest, cleanse, transform, and process large datasets
Optimize performance through partitioning, caching, serialization, and tuning techniques
Handle data formats such as Parquet, ORC, Avro, JSON
Integrate multiple data sources including streaming systems, flat files RDBMS, and APIs
Collaborate with cross-functional teams to understand business requirements and translate them into scalable technical solutions
Ensure data quality, reliability, and performance monitoring across pipelines
Participate in code reviews, design discussions, and best practices implementation

Key Skills:

Distributed Data Processing.
Spark Optimization & Performance Tuning.
Real-time Data Streaming.
Data Modeling & ETL Design.
Problem-solving and Analytical Thinking.
Strong Communication & Stakeholder Management.

Nice to Have:

Exposure to Machine Learning pipelines or MLOps workflows.
Experience with Databricks platform.
Experience with AWS/GCP.

Summary:
This role requires a highly skilled Spark Scala Developer with strong expertise in big data engineering, streaming systems, and distributed computation, capable of building scalable, high-performance data platforms supporting enterprise analytics.

Estimated annual compensation range for the candidate based in the below location will be:
Ontario: $ 92740 to $ 123375

The job entails sitting as well as working at a computer for extended periods of time. Should be able to communicate by telephone, email or face to face. Travel may be required as per the job requirements

Client-provided location(s): Mississauga, Canada

Job ID: Infosys-149066BR

Employment Type: OTHER

Posted: 2026-06-09T18:45:53

Perks and Benefits

Health and Wellness
- Health Insurance
- Life Insurance
- HSA
- Short-Term Disability
Parental Benefits
- Birth Parent or Maternity Leave
- Non-Birth Parent or Paternity Leave
- On-site/Nearby Childcare
Work Flexibility
Office Life and Perks
- Commuter Benefits Program
Vacation and Time Off
- Paid Vacation
- Paid Holidays
- Personal/Sick Days
- Sabbatical
Financial and Retirement
- 401(K)
- Relocation Assistance
Professional Development
- Learning and Development Stipend
Diversity and Inclusion
- Employee Resource Groups (ERG)

Want more jobs like this?

Perks and Benefits

Health and Wellness

Parental Benefits

Work Flexibility

Office Life and Perks

Vacation and Time Off

Financial and Retirement

Professional Development

Diversity and Inclusion