New
Software Engineer
![]() | |
![]() United States, Texas, Irving | |
![]() 7000 State Highway 161 (Show on map) | |
![]() | |
OverviewHave you ever imagined the world with an infinite amount of storage available and accessible to everyone? A place where everyone in the world can easily access their books/music/photos/video/any data from anywhere at any time via any means (e.g. mobile phones, tablets, PCs, smart devices, etc). Did you ever desire a universally accessible storage system to record all the knowledge known to mankind, to keep all the books/music/videos ever created, or to store all the data collected from all the scientists in the world for them to collaborate upon? Do you want to be part of a team that strives to bring these to reality?If so, the Microsoft Azure Storage team is what you are looking for. We are building Microsoft's cloud storage solution - Microsoft Azure Storage, which is a massively scalable, highly distributed, ubiquitously accessible storage system, designed to scale out and serve the entire world. We continue to have tremendous hockey stick growth, we have many Exabyte's of data stored, and are designing and building systems for Zettabyte scale to support demand growth for the coming years.We are looking for engineers who are passionate about distributed storage, more specifically in the areas of resource management of distributed systems across an entire geo-region! Candidates who want to work on a fast-paced team with talented engineers will thrive here. The Azure Storage Limitless and Cluster Resource Manager Team manages control and data plane operations that manage hundreds of thousands of servers at exabyte scale while serving hundreds of millions of requests per seconds at low latency. We provide the semantics to virtualize customer accounts and physical hardware across entire geo regions. We also develop and maintain infrastructure related to high performance transfer of customer accounts across storage scale units. Additionally, these areas present challenging technical problems in a space where innovation is always happening.We are working on storage control plane, resource management, cost of goods sold (COGS), and scale-related projects in XStore. The regional scale management team, also known as xLimitless, within XStore is an integrated and comprehensive resource management group responsible for smarter allocation of storage accounts and load balancing storage tenants across various resource dimensions such as central processing unit (CPU), memory, input/output operations per second (IOPS), and capacity. This is achieved by migrating storage accounts and virtualizing or scaling out storage accounts in the background, allowing the accounts to operate at scale with no limits.To be successful in these areas, you must thrive while solving challenges related to durability, availability and concurrency for a distributed system. You will have an opportunity to make high impact changes on a daily basis as you build a hyper scale storage system that may indirectly or directly be used daily by your friends and family.Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesWorks with appropriate stakeholders to determine user requirements for a feature.Supports identification of dependencies, and the development of design documents for a product feature with oversight.Optimizes cost of goods sold (COGS) for Azure Storage, while enabling customers to scale out without limits on ingress, egress, input/output operations per second (IOPS), or capacity.Designs, implements, tests, and rolls out features that require you to think at zettabyte scale across tens of thousands of clusters worldwide -- these include distributed load balancing, performance tuning, and massively parallel control plane features to manage the exponentially growing storage fleet.Learns to create and implement code for a product, service, or feature reusing code as applicable, with guidance.Assists and learns about breaking down work items into tasks and provides estimation.Acts as a Designated Responsible Individual (DRI) in monitoring system/product feature/service for degradation, downtime, or interruptions for simple problems, and recommends actions to restore system/product/service by following the playbook.Reviews current developments and proactively seek new knowledge that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale. |