Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Technology Lead - Spark Scala Developer

Yesterday Mississauga, Canada

Infosys is seeking an experienced Spark Scala Developer to design, develop, and optimize scalable bigdata solutions. The candidate will work on building high-performance batch and real-time data pipelines leveraging the Hadoop ecosystem and distributed computing frameworks(Spark). The role involves working closely with data engineers, architects, and business stakeholders to deliver robust, scalable, and efficient data processing systems.

Required Qualifications:

  • Candidates authorized to work for any employer in Canada without employer-based visa sponsorship are welcome to apply. Infosys is unable to provide immigration sponsorship for this role at this time.
  • Candidate must be located within commuting distance of Mississauga, Ontario or be willing to relocate to the areas.
  • Bachelor's degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
  • At least 4 years of Information Technology experience
  • 4+ years of experience in Big Data technologies.
  • Strong expertise in:
    • Apache Spark (Core, SQL, DataFrames, RDDs)
    • Scala programming
    • PySpark
  • Hands-on experience with:
    • Kafka (real-time streaming)
    • Hadoop ecosystem (HDFS, Hive, Impala)
    • NoSQL Databases (HBase, MongoDB, Couchbase)
  • Strong understanding of distributed computing concepts and data processing frameworks.
  • Experience in building ETL/data pipelines for large-scale datasets.
  • Proficiency in SQL and data modeling.
Preferred Qualifications:

Want more jobs like this?

Get Data and Analytics jobs in Mississauga, Canada delivered to your inbox every week.

Job alert subscription
  • Hands-on experience with data lakes, data warehouses, and scalable ETL pipeline design, including batch and real-time processing architecture.
  • Strong understanding and practical exposure to Agile software development methodologies (Scrum) and SDLC practices.
  • Proven experience in Banking domain, supporting use cases such as fraud detection, risk analytics, regulatory reporting, and customer insights.
  • Excellent analytical, problem-solving, and communication skills, with the ability to translate business requirements into scalable technical solutions.
  • Demonstrated ability to work effectively in cross-functional, multi-stakeholder environments, collaborating with Business, Data Engineering, and Architecture teams.
  • Experience with real-time data streaming frameworks such as Kafka and Spark Streaming for low-latency processing.
  • Understanding data modeling concepts (dimensional modeling, snowflake schemas) to support analytics workloads.
  • Experience and desire to work in a global delivery environment.
Key Responsibilities:
  • Design and develop large-scale data processing pipelines using Apache Spark (Scala & PySpark)
  • Build and optimize batch and real-time data processing workflows using Spark, Kafka, and Hadoop ecosystem
  • Develop Spark applications using RDDs, DataFrames, and Spark SQL for complex transformations
  • Develop and optimize PySpark applications leveraging joins, Spark DAG execution flow, stage optimization, transformation techniques, and streaming with dynamic allocation and failover handling.
  • Implement streaming pipelines using Kafka and Spark Streaming / Structured Streaming
  • Develop and maintain HDFS, Hive, NoSql and Impala-based data lake solutions
  • Convert existing SQL/Hive workloads into optimized Spark jobs for improved performance
  • Work with ETL pipelines to ingest, cleanse, transform, and process large datasets
  • Optimize performance through partitioning, caching, serialization, and tuning techniques
  • Handle data formats such as Parquet, ORC, Avro, JSON
  • Integrate multiple data sources including streaming systems, flat files RDBMS, and APIs
  • Collaborate with cross-functional teams to understand business requirements and translate them into scalable technical solutions
  • Ensure data quality, reliability, and performance monitoring across pipelines
  • Participate in code reviews, design discussions, and best practices implementation
Key Skills:
  • Distributed Data Processing.
  • Spark Optimization & Performance Tuning.
  • Real-time Data Streaming.
  • Data Modeling & ETL Design.
  • Problem-solving and Analytical Thinking.
  • Strong Communication & Stakeholder Management.
Nice to Have:
  • Exposure to Machine Learning pipelines or MLOps workflows.
  • Experience with Databricks platform.
  • Experience with AWS/GCP.
Summary:
This role requires a highly skilled Spark Scala Developer with strong expertise in big data engineering, streaming systems, and distributed computation, capable of building scalable, high-performance data platforms supporting enterprise analytics.

Estimated annual compensation range for the candidate based in the below location will be:
Ontario: $ 92740 to $ 123375

The job entails sitting as well as working at a computer for extended periods of time. Should be able to communicate by telephone, email or face to face. Travel may be required as per the job requirements

Client-provided location(s): Mississauga, Canada
Job ID: Infosys-149066BR
Employment Type: OTHER
Posted: 2026-06-09T18:45:53

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Life Insurance
    • HSA
    • Short-Term Disability
  • Parental Benefits

    • Birth Parent or Maternity Leave
    • Non-Birth Parent or Paternity Leave
    • On-site/Nearby Childcare
  • Work Flexibility

    • Office Life and Perks

      • Commuter Benefits Program
    • Vacation and Time Off

      • Paid Vacation
      • Paid Holidays
      • Personal/Sick Days
      • Sabbatical
    • Financial and Retirement

      • 401(K)
      • Relocation Assistance
    • Professional Development

      • Learning and Development Stipend
    • Diversity and Inclusion

      • Employee Resource Groups (ERG)