Leverage AWS design principles, data access patterns, and relevant AWS services to enhance the efficiency and performance of the data applications.
Monitor AWS resources to ensure availability and scalability.
Troubleshoot cloud-related infrastructure incidents and issues.
Perform deployments and upgrades in AWS.
Improve cloud product reliability, availability, maintainability, and cost/benefit-including developing fault-tolerant tools to ensure general robustness of cloud infrastructure.
Perform operations such as backup and restore.
Assist in ensuring security best practices for the cloud are followed and customer data is secured.
Stay up to date with cloud technology trends and best practices.
Creating and maintaining technical documents for cloud infrastructure and related processes
Have an advanced understanding of core networking and security standards and best practices.
Maintain regular communication with users regarding best practices, policies, procedures, and scheduled maintenance.
Documents and maintains work instructions, and completes audit-required tasks.
Requirements & Skills:
5 years of experience working with AWS services EC2, ECS, EKS, S3, SQS, Lambda, RDS, Athena, AWS Glue, Lakeformation, SNS, Load Balancers (ALB, ELB, NLB), IAM, VPC, Subnets, Cloudwatch and Route53.
Hands-on experience writing Terraform to provision and manage AWS cloud infrastructure.
Basic familiarity with network features, e.g., cloud network topology, routing
Configuration and provisioning of infrastructure for hosting third-party applications like Tableau, Cognos, etc.
Experience in writing scripts using Python, Ruby, YAML, or Shell scripts to automate tasks.
Experience handling patches, fixing vulnerabilities, AWS Cloud Watch event monitoring, and Cert update via AWS Cert Manager.
Experience in maintenance of Metadata repositories such as AWS Glue Catalog, and Glue crawler for data stored in: S3.
Experience with Story/Task/Bug tracking using tools with Jira.
Knowledge of SQL and relational/Columnar databases (Postgres, Redshift)
Good understanding of Authentication/Authorization concepts SSO, PING
Experience in Integration Services (Informatica Cloud, Airflow) is a plus.
Experience in cloud-based machine learning platforms (AWS Sagemaker) is a plus.
Experience in Data warehousing solutions (AWS Redshift) is a plus.
Experience in agile development processes and DevOps methodology/principles.
Experience in working within compliance (e.g.: quality, regulatory – data privacy, SOC) and cybersecurity requirements is a plus.