Sumit Kumar
Bellevue WA 98005 USA
Phone +1-425-XXX-XXXX
Email [@]



  1. Integrated Masters in Maths & Computing, Indian Institute of Technology, Kharagpur(2000-05)

    Thesis title: An Intelligent Emergency Patient Monitoring System using distributed paradigms
    Advisor: Prof. A. K. Majumdar and Prof. J. Mukhopadhyay, Dept of Computer Sc. & Engg., IIT Kharagpur

Work Experience

  1. Principal Software Engineer, Microsoft Azure, Redmond (Aug 2019-till date)

    Identified and fixed multiple bugs, design and process issues to improve cluster creation and operational reliability for long running HDInsight clusters, improving experience for high revenue customers and reducing support burden.
    Led design and implementation for core platform for providing bigdata services (Hadoop, Hive, Zookeeper etc) on Azure Kubernetes Service.
  2. Senior Software Engineer (Big Data), Zillow Inc, Seattle (Jun 2019-Aug 2019)

    Created a POC implementing bloom filter indices in an existing ETL pipeline, reducing the amount of data processed, ultimately demonstrating savings in more than 100k dollars per year in Amazon EMR costs.
  3. Sr SDE at AWS Lambda, Seattle (Mar 2018-Jun 2019)

    Implemented support for 15 minute function duration (up from 5 minute)
  4. Sr SDE at Amazon Aurora, Seattle (Jul 2016-Feb 2018)

    Created automation for patching OS libraries on tens of thousands of hosts in the storage fleet without effecting service uptime.
    Implemented handling for s3 LSE without degrading db performance.
  5. Sr SDE at AWS Elastic MapReduce, Seattle (Oct 2013-Jul 2016)

    Improved processing performance for Hadoop and Hive by optimizing listing of input data on S3 and rename and delete of intermediate outputs.
    Improved performance of "msck repair table" for EMR Hive 13 on S3 by 10x of standard Apache implementation, this was even 2x better in performance of EMR's private feature "alter table recover partitions".
    Implemented EMR Kinesis connector for Hadoop, Hive, Pig and Cascading. Implemented open source Kinesis Log4J Appender for easy ingestion of application logs into Kinesis.
    Implemented several changes to improve availability for EMR clusters. Implemented compression of logs that reduced the cost for logs and debugging by 10x.
  6. PMTS at iLabs(R&D), [24], Bangalore (Jul 2012-Sep 2013)
    Dev Manager at iLabs(R&D), [24], Bangalore (Sep 2011-Jul 2012)
    Tech Lead at iLabs(R&D), [24], Bangalore (Jul 2009-Sep 2011)

    Implemented scala based agent for delivering application logs incrementally to a Kafka and Zookeeper based log collection platform. Implemented a scalable data export application to fetch processed data from these platforms to consuming application databases. Worked on Hadoop, Hive, Oozie and MySQL customization for batch data and real-time platform setup.
    Led a team of 3 to deliver widget framework for chatbots and facebook like workspace on [24]7 chat platform.
    Led a team of 4 to deliver Hive based weekly and monthly reporting framework using Amazon EMR. We solved sessionization of user data spread over 2 weeks and longer.
    Designed and developed a web based rules engine for visitor tracking and behavior evaluation on websites. Implemented client side framework for user data collection, finite state machine and rules evaluation. Implemented core features for browser side storage, cross domain communication, widget design and flow orchestration based on HTML5.
  7. Senior S/W Engineer, Y! India R&D, Bangalore (Jul 2008-Jul 2009)

    Contributed to design and development of Oozie - a workflow engine for managing and coordinating jobs running on Hadoop. Also worked on data acquisition and syndication from Reuters for Y! News end-to-end on Hadoop based internal platforms.
  8. Senior Member Technical Staff, Oracle India Server Technologies Division, Bangalore (May 2006-July 2008)

    Worked in Oracle Web Conferencing team and delivered a server for streaming live web conferences using RTSP, SDP and RTP to Windows Media Player and QuickTime Player. Also developed a java library for transcoding generic media content to ASF and FLV formats. Worked on full development cycle of the product including research (byte level reverse engineering of then not public specifications), design and development from scratch, sanity testing and performance improvement.
  9. Associate S/W Engineer, Computer Associates, Hyderabad (May 2005-May 2006)

    Worked in "Platform Technologies Group" and ported existing CA applications to Unix (Solaris/HPUX/AIX) platforms.
  10. Student Associate, TeleMedicine Lab, IIT Kharagpur (May-Jul 2003)

    Implemented TeleMedicine software for PocketPC. A paper was presented at Asia Pacific Telecommunity (APT) Telemedicine Workshop 2004.
  11. Student Associate, CET, IIT Kharagpur (Jan-Jun 2002)

    Implemented a "Digital Simulation Software" using JAVA and XML, for remote digital circuit lab experiments. Circuits designed at the client side, were simulated on the remote server using RMI, after proper user authentication.


Personal Summary


Last Updated: Aug 03 2022