DevOps Engineer
DELL, Remote
Feb 2022 - Present
- Instituted Git branching strategies like GitFlow to streamline development, ensuring cleaner code
integration and easier release management.
- Spearheaded the Dockerization initiative, migrating legacy applications to Docker containers, reducing
infrastructure overhead by 30%.
- Optimized Docker images by implementing multi-stage builds, reducing image sizes by 40% and enhancing
deployment speed.
- Developed custom Docker Compose configurations, orchestrating multi-container applications for seamless
deployment and testing.
- Automated Kubernetes deployment on on-premise clusters using Ansible, improving deployment speed by 40%.
- Involved in the development of dev/test/prod environment on Docker containers using Kubernetes and
responsible for creating, managing, and troubleshooting pods and cron jobs using Kubernetes.
- Orchestrated and managed multiple Kubernetes clusters using Kubefed, ensuring efficient cluster
deployment and management.
- Executed Role-Based Access Control (RBAC) strategies, precisely defining user permissions within
Kubernetes clusters, ensuring least privilege access.
- Integrated Kubernetes Secrets and ConfigMaps for sensitive data management, encrypting and securely
managing configuration details and credentials.
- Implemented Kubernetes PersistentVolumeClaims (PVCs) and PersistentVolumes (PVs) to dynamically
provision NFS-backed storage for applications, ensuring data persistence.
- Experienced in using Ansible as a configuration management tool and developed Ansible Playbooks to
provision Apache Web servers, Tomcat servers, Nginx, and other applications.
- Implemented Velero as a robust backup and restore solution for Kubernetes clusters, ensuring data
protection and disaster recovery.
- Implemented Service Mesh (e.g., Istio) for enhanced security, facilitating encryption, authentication,
and authorization between microservices within the cluster.
- Led the deployment and management of ML pipelines using Kubeflow, enhancing data processing efficiency
by leveraging Kubernetes.
- Spearheaded the development of an AI-based document search chatbot using LLM and Gradio, streamlining
document retrieval processes.
- Utilized Elastic Search for indexing data, optimizing data retrieval for semantic searches.
- Used Jenkins pipelines to drive all microservices builds out to the Docker registry and then deployed to
Kubernetes, which improved performance by 25%.
- Contributed significantly to the deployment of CNVrg and Omnia, supporting high-performance cluster
deployments for AI and analytics workloads.
- Implemented Prometheus and Grafana for robust Kubernetes and infrastructure monitoring, enabling
proactive issue identification.
- Developed Python scripts to automate the build and deployment process for deploying the web services and
created Bash, shell, and python scripts for various Systems Administration tasks to automate repeated
processes.
|
DevOps Engineer
Torch.AI, Remote
July 2021 - Feb 2022
- Works closely with the technical and support teams to deliver and manage services with high uptime.
- Advocate for automated processes to improve quality, deployment, and support of multiple products.
- Build and maintain solutions and tools for automated builds, deployment, monitoring, and operations
Troubleshoot, perform root cause analysis, and implement fixes for deployment and infrastructure issues
- Builds new environments and updates existing environments through the use of automation.
- Maintain cloud infrastructure for multiple products and services.
- Worked on different queuing systems like rabbitmq, Kafka, sqs, etc.
- Worked on building/managing a cloud-based infrastructure in AWS and Google Cloud.
- Worked on building/managing Kubernetes environments.
- Managed Kubernetes charts using Helm. Created reproducible builds of the Kubernetes applications,
managed Kubernetes manifest files, and managed releases of Helm packages.
- managing continuous integration and deployment pipelines using Azure DevOps
- Setup of ELK (Elasticsearch, Fluentd & Kibana) on Kubernetes.
- Created different variations of the Kibana dashboard running different instances of Elasticsearch,
Logstash, and Kibana. Responsible for planning index and shards and index TTL strategies in Elasticsearch.
Troubleshooting Elasticsearch errors.
- Worked on microservices frameworks like Spring Boot, Swagger.
- Virtualized the servers using Docker for the test environments and dev-environments needs, also
configuration automation using Docker containers.
- Implemented Microservices in a load balanced, highly available, fault tolerant Kubernetes
infrastructure.
- Create develop and test environments of different applications by provisioning Kubernetes clusters on
AWS using Docker and Terraform
- Performed all Linux operating system, disk management, and patch management configurations, on Linux
instances in AWS
- Created branches and managed the source code for various applications in GIT.
- Deploy the various services like RabbitMQ, Activiti, Redis, Elasticsearch, ArangoDB, Postgres, Spark,
and also data containers in containerized environments using Kubernetes.
- Configure the docker image with a certificate so that the normal deploy process automatically uses a
secure connection for ActivitiUI to run over HTTPS.
- Verified that the latest Arango version works with SSL and turned it on in all environments.
- Configured Cloud Watch and Datadog to monitor real-time granular metrics of all the AWS Services and
configured individual dashboards for each resource Agents.
|
Cloud/DevOps Engineer
SunPower Corp, Remote
Sept 2019 - July 2021
- AWS and designed, built, and deployed a multitude of applications utilizing almost all the AWS stack
focusing on high availability, fault tolerance, and auto-scaling with CloudWatch monitoring.
- Automated provisioning of infrastructure for building environments on AWS CloudFormation stacks from the
resources VPC, EC2, S3, RDS, Dynamo DB, IAM, EBS, Route53, SNS, SES, SQS, CloudWatch, Security Group, Auto
Scaling.
- Structured the terraform setup having S3 as backend and applied state lock to the bucket using Dynamo DB
and used the S3 bucket to store terraform state files that aided in making changes to the resources to the
environment being developed which decreased the inaccuracy of making changes to a minimum.
- A deployed application that is containerized using Docker onto a Kubernetes cluster which is managed by
Amazon Elastic Container Service (ECS) for Kubernetes (EKS – Elastic Kubernetes Service).
- Worked on installing virtual machines in Azure and used Azure PowerShell and Portals to manage Azure
subscriptions and services.
- Worked in Azure compute services, Azure WebApps, Azure Functions, Azure Data Factory, Azure Media, Azure
Networking, Azure Hybrid Integration, Azure Identity, and Access Management.
- Experience in integrating and implementing CI/CD automation using Jenkins with various tools like Maven
(Build tool), Git (Repository), SonarQube (static Code analysis), and Nexus (Artifact repository).
- Created Docker Swarm using Docker CLI to Orchestrate, Schedule and deploy the services to Swarm and
managed the Swarm behavior. Created Docker file for each microservice and also changed some of the Tomcat
configuration files which are required to deploy Java-based applications to the Docker container.
- Developed environments of different applications on AWS by provisioning EC2 instances using Docker,
Bash, and Terraform. Created Docker images using a Docker file, worked on Docker container snapshots,
removed images, and managed Docker volumes.
- Worked on Kubernetes to automate the deployment, scaling, and operations of application containers
across clusters of hosts.
- Worked on building and deploying the application code using Kubernetes CLI called kubectl, kubelet,
kubeadm, kubespray, and schedule the jobs using Kube scheduler. Managed K8s charts using Helm and created
reproducible builds of the Kubernetes applications.
- Worked on Chef and Ansible for the deployment of servers.
- Used Ansible Playbooks Continuous Delivery Pipeline. Deployed microservices, including provisioning AWS
environments using Ansible Playbooks.
- Developed Shell Scripts and Ruby for automation of the build and release process. Developed custom
solutions in C# and PowerShell to validate availability, consistency, and compliance with environments.
- Extensively worked with Jira and Slack for project management and internal team communication.
|
Linux/DevOps Engineer
Hilton Hotels Worldwide, Memphis, TN
Feb 2017 - Sept 2019
- Worked on migrating VMWARE VMs to AWS and worked on managing services like EC2, S3 Bucket, Route53, ELB,
EBS utilizing Puppet.
- Worked on migrating data centers to AWS, utilized CloudFormation Templets and Amazon S3 to migrate
various applications from on perm to AWS.
- Worked on automating and creating templates for deployment of internal applications to Dev, Test, and
Production environments including the automation of the release processes utilizing Puppet.
- Worked on creating Web applications utilizing Elastic Beanstalk integrated Docker.
- Worked on scaling and orchestrating containerized applications utilizing Kubernetes.
- Worked with Kubernetes deployments, Kubernetes volumes, Kubernetes, Kubernetes network policy, and
Kubectl.
- Automated repetitive and manual tasks by utilizing Shell Scripting.
- Installed, tested, upgraded, and troubleshot server issues and worked on building and configuring RedHat
Linux System.
- Worked on handling NFS share mode operations in ZFS Oracle Storage At share & project Level by
automation.
- Develop and maintain cooperative and collaborative relationships with networking, storage, software
development, and project management teams
- Monitor production systems, applications, and network performance through the use of various Network
Management and Application Monitoring Tools
- Develop tools/scripts to automate integration with other IT tools in support of accurate asset
management, cyber reporting capabilities and to manage licenses
- Work with software developers and Quality Assurance to seamlessly migrate applications from development
to QA and production
- Performing a wide variety of daily Control M management tasks as well as providing recommendations for
effective and efficient management of system
|
Linux System Admin
Hyla Mobile, Dallas, TX
Jan 2015 - Feb 2017
- Identify and drive continual improvement programs in the LINUX environment and improve service
efficiency.
- Worked with application deployments, upgrades, data migrations, and patches.
- Experience parsing event logs to identify and troubleshoot incidents.
- Troubleshoot virtual machine backup and restore failures as well as perform database migrations.
- Maintaining Linux environment and basic services such as DHCP server, squid proxy server, and mail
servers.
- Creating new technical solutions and strategies.
- Building, configuring, patching, upgrading, and troubleshooting physical and virtual servers.
- Provide detailed project status reporting as directed by management.
- Performs day-to-day maintenance and installation of the server, network equipment, virtual machines,
messaging, backups, and storage.
- Installing of ESXi hypervisor servers, vSphere clients, and VCenter Servers.
- Restoring data in the event of an emergency.
- Building and maintaining Domain servers.
- Troubleshooting and maintaining TCP/IP, Apache, SMTP, and DNS applications.
- Configuring and adding new services, as necessary.
- Upgrading and configuring system software that supports infrastructure applications.
- Applying OS patches and upgrading regularly.
|