Job Description:
At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day.
Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being an inclusive workplace, attracting and developing exceptional talent, supporting our teammates' physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve.
Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations.
Want more jobs like this?
Get jobs delivered to your inbox every week.
At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us!
Job Description:
This job is responsible for providing front-line support to end users, responding to issues related to incidents and problem management governance for multiple applications, and leading triage activities on all business impacting incidents. Key responsibilities include ensuring compliance with incident management and problem management policies and procedures, serving as a focal point for the customer, client, and associate experience, restoring complex production incidents under tight Service Level Agreements, and pursuing root cause and problem resolution follow ups.
Responsibilities:
- Leads production support triage efforts, manages bridge line troubleshooting, engages in technical research, and escalates issues to leadership as needed
- Ensures all impacts are accurately recorded and documented in the system of record, oversees that documents and wikis are updated and available for use during triage, and supports the documentation of application flows, upstream/downstream impacts during outages, the customer experience, and contacts for support needs
- Identifies and/or validates business impacts through interpretation of monitors, dashboards, and logs to communicate with leadership and vendors
- Manages activities to identify incident root cause, resolution, preventative actions, and change requests, and reports on incident data quality
- Promotes and enforces production governance during triage/testing and identifies production failure scenarios, vulnerabilities, and opportunities for improvement
- Serves as a subject matter expert for applications within a portfolio, leveraging extensive knowledge of application functionalities and application flows
- Assesses and prioritizes research requests, ad hoc reports, and offline incidents at the direction of senior team members and delegates work as needed to team members and peers
Required Qualifications
- 5+ years in technology experience.
- Hadoop System Administration skills with at least 3+ years of experience of handling Cloudera distribution of Hadoop or Horton works.
- Experience on evaluating the Proof of Concept on new Hadoop tools and related technologies. Formulate and design systems scope and objectives for applications and the development of information technology projects.
- Extensive experience in big data tools: Hadoop, Hive, Impala and Spark.
- Proficiency in Scala, Python, SQL, and PySpark. Strong experience and triaging skills on the hive, Spark data analysis & Impala.
- Experience with stream-processing systems: Kafka, Spark-Streaming, etc.
- Experience on configuring high availability of the Cloudera Hadoop services to achieve never down situation on Hadoop components.
- Good understanding of Operating System (OS) concepts, process management, capacity planning and resource scheduling.
- Basics of Infrastructure networking, Memory, Racks and storage to help building the clusters for new client onboarding.
- Proficiency in managing and communicating data warehouse plans to internal clients.
- Great knowledge on Resiliency management for data and metadata on the Hadoop cluster via tools like Back-up & Disaster Recovery (BDR) & any other real time replication mechanism.
- Experience with building processes supporting data transformation, data structures, metadata, dependency, and workload management.
- Strong analytic skills related to working with structured & unstructured dataset. A successful history of manipulating, processing, and extracting value from large, disconnected datasets.
Desired Qualifications
- Self-Motivated; ability to multi-task efficiently.
- Strong communication skills, backed up with strong skills on data lake concepts.
- Ability to handle production triages coordinating with all the required parties like support groups, users, vendor etc.
- Must have idea on change management routines to implement solutions calculating the risks.
- Strong SQL skills on any RDBMS like Oracle, Teradata, MySQL
- Job Scheduling tools : Autosys.
- Business Intelligence (BI) tools : Tableau, Statistical Analysis System (SAS)
Skills:
- Adaptability
- Analytical Thinking
- Influence
- Production Support
- Risk Management
- Automation
- Collaboration
- Innovative Thinking
- Result Orientation
- Solution Design
- Business Acumen
- DevOps Practices
- Project Management
- Solution Delivery Process
- Stakeholder Management
Shift:
1st shift (United States of America)
Hours Per Week:
40