San Francisco Bay Area
16K followers 500+ connections

Join to view profile

About

Strategic Engineering Leader | SRE, Platform & Observability at Petabyte Scale

I…

Articles by Anmol

Activity

Join now to see all activity

Experience & Education

  • Splunk

View Anmol’s full experience

See their title, tenure and more.

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Volunteer Experience

  • Year Up Graphic

    Year Up Mentor

    Year Up

    - 2 years 5 months

    Education

Publications

  • The Cardinal Sin of Data Mining and Data Science: Overfitting

    Computer Society of India

    Overfitting leads to public losing trust in research findings, many of which turn out to be false. We examine some famous examples, "the decline effect", Miss America age, and suggest approaches for avoiding overfitting.

    Other authors
    See publication
  • So you wanna be a data scientist? A guide to 2015's hottest profession

    Mashable

    Byte from interview published in article on Mashable.

    See publication
  • Big Data for Business Managers: Bridging the gap between potential and value

    IEEE BigData 2013

    The research paper has been cited/referenced in over 40 research papers/books/articles.

    Abstract — Given the surge of interest in research, publication and application on Big Data over the last few years, the potential of Big Data seems to be well-established now across businesses. However, in most of the business implementations Big Data still seem to be struggling to deliver the promised value (ROI). Such results despite using the market leading Big Data solutions and talented…

    The research paper has been cited/referenced in over 40 research papers/books/articles.

    Abstract — Given the surge of interest in research, publication and application on Big Data over the last few years, the potential of Big Data seems to be well-established now across businesses. However, in most of the business implementations Big Data still seem to be struggling to deliver the promised value (ROI). Such results despite using the market leading Big Data solutions and talented deployment team are forcing the business managers to think what needs to be done differently.

    This paper lays down the framework for business managers to understand Big Data processes. Besides providing a business overview of Big Data core components, the paper presents several questions that the managers must ask to assess the effectiveness of their Big Data processes.
    This paper is based on the analysis of several Big Data projects that never delivered and comparison against successful ones. The hypothesis is developed based on public information and is proposed as the first step for business managers keen on effectively leveraging Big Data.

    Keywords—Big Data, Analytics, Business Managers, Failure, Critical Errors, Potential, Value

    See publication

Courses

  • Analysis of Programming Languages

    CS 253

  • Big Data and Cloud Computing

    CS 295

  • Data Structures

    CS 261

  • Distributed Computer Systems

    CS 230

  • Image Understanding

    CS 216

  • Introduction to Artificial Intelligence

    CS 271

  • Introduction to Embedded and Ubiquitous Systems

    CS 244

  • Introduction to Machine Learning

    CS 273a

  • Principles of Data Management

    CS 222

Projects

  • Evolution of Artificial Neural Networks Over Distributed Environments

    -

    Paper Abstract:
    During the last few decades the field of neural networks and massive parallel computing has evolved rapidly. Artificial Neural Networks have been studied since 1943, when Warren McCulloch and Walter Pitts wrote their first paper on how neurons might work. Since then there has been a tremendous research work on neural networks to simulate brain processing. Neural networks are computation intensive and therefore, parallelization helps to reduce the training time significantly.…

    Paper Abstract:
    During the last few decades the field of neural networks and massive parallel computing has evolved rapidly. Artificial Neural Networks have been studied since 1943, when Warren McCulloch and Walter Pitts wrote their first paper on how neurons might work. Since then there has been a tremendous research work on neural networks to simulate brain processing. Neural networks are computation intensive and therefore, parallelization helps to reduce the training time significantly. In this paper we would cover how neural networks evolved over time in distributed environments. We would start with the early implementations of neural networks, where researchers started to leverage parallelism and cover the recent advances in this direction that has made neural networks such a big success. Neural networks are widely used in the field of natural language processing, image recognition, speech recognition, etc. Since neural networks generally contain a large number of layers it has been considered as one of the prominent Deep Learning architectures.

    Other creators
  • Visual Dominance on Social Networks

    -

    Social media has become an important part of lives of everyone today. Nearly two-thirds of American adults (65%) use social networking sites. There has been a rapid development in this field during the past decade. Earlier, it was treated just a medium to interact online with friends and family. However, it has evolved over the years and now it is an integral part of not only individuals but also teams sitting in large corporate organizations. On the same line, there has been a constant…

    Social media has become an important part of lives of everyone today. Nearly two-thirds of American adults (65%) use social networking sites. There has been a rapid development in this field during the past decade. Earlier, it was treated just a medium to interact online with friends and family. However, it has evolved over the years and now it is an integral part of not only individuals but also teams sitting in large corporate organizations. On the same line, there has been a constant development of tools and techniques to leverage social media at its best for marketing, campaigning and other purposes. There are umpteen tools available in the market, which perform text analysis to get hidden insights from mass chunk of social media data. Text analysis lays foundation for sentiment analysis, an art to understand the sentiment from text. However, with the evolution of social media, now it involves a large mass of media content such as pictures, videos, etc. Although everyone would agree with the expression by Tess Flanders that “Use a picture. It's worth a thousand words.” there has not been any significant development of analysis of media content such as images/videos on social networks. In this paper This project aimed towards filling that missing piece by implementing an approach to analyze images on a particular social network.

  • Auto-Tune Your Guitar

    -

    This embedded systems project involved:
    Use of an android cell phone’s microphone to record sound that is played when a string is plucked.
    Android application processes the signal from microphone to determine if the string is in tune or if the pitch is too low/high.
    Phone communicates to an Arduino board via bluetooth.
    Arduino controls a servo module to rotate the guitar peg accordingly.
    The above process is repeated until the correct pitch is achieved.

    Other creators
    • Sang Van Hai
    • Danny Armenta
    • Nil Mamano
    See project
  • Implementation and Analysis of Search Algorithms in Connect Four Game

    -

    Report Summary(can be shared on request):
    Adopted open-source implementation of Connect Four Game.
    Worked on enabling computer as a strong opponent against human player.
    Applied various mechanisms and analyzed their impact on the analysis of game tree and moves.
    Performed a comparative analysis of various approaches starting from brute force to smart heuristics.

    Other creators
    • Krish Furia
    • Harsh Bahua
    See project
  • Relational Database Management System Design and Implementation

    -

    Built a relational database system from scratch in C++ (The source code for the project is private as required by the course policy. If you are interested, I'll provide access based on your request):

    Layer 0: Implemented a Paged File (PF) system and few operations of a Record-Based File (RBF) manager.
    Layer 1: Finished all the operations of the RBF manager and implemented a Relation Manager (RM) on top of the basic record-based file system.
    Layer 2: Implemented an Indexing (IX)…

    Built a relational database system from scratch in C++ (The source code for the project is private as required by the course policy. If you are interested, I'll provide access based on your request):

    Layer 0: Implemented a Paged File (PF) system and few operations of a Record-Based File (RBF) manager.
    Layer 1: Finished all the operations of the RBF manager and implemented a Relation Manager (RM) on top of the basic record-based file system.
    Layer 2: Implemented an Indexing (IX) component which provides classes and methods for managing persistent indexes over unordered data records stored in files. Built indexes using Top-down B+ Tree approach.
    Layer 3: Implemented a query engine which provided functions such as Filter, Project, Join (Grace Hash Join) and Aggregate.

  • Predicting rainfall using ensemble of Ensembles

    -

    In-class competition held on Kaggle (http://inclass.kaggle.com/c/how-s-the-weather)

    Presented my experiments with multiple approaches, ranging from Ensemble of Learning to Deep Learning Networks on the weather data to predict the rainfall.

    Results for weighted ensemble of learners landed me in top 10% of class, with the testing root-mean-squared error being 0.5741.

  • A Survey on Geography Based Routing in Mobile Ad Hoc Networks

    -

    Researched the current popular approaches to enhance MANet (Mobile Ad hoc NETworks) performance through location services. A comprehensive qualitative comparison was performed and documented in the research paper.
    Key concerns with the existing protocols were identified and listed as opportunities for future research.
    Simulation tool: NS2

    Other creators
  • Implementation of Distributed Databases

    -

    Designed, implemented and tested a real-time distributed database by leveraging tools and applying techniques. Used Oracle Database 10g Express Edition

    Other creators
  • URL Ranking Algorithm for Web Crawler Optimization

    -

    Designed an efficient algorithm to assist a web crawler through prioritization and optimization of the input URL set. The algorithm was tested against URL Reputation Data Set (a set of 16,000 URLs with over 3 million attributes). The design included data selection, preprocessing, entropy calculations and finally, information gain comparisons to deliver the list of selected URLs, sorted in order of importance.
    Development platform: C/C++, Matlab

    Other creators
  • Smart Document Management System

    -

    Developed a document management system that personalizes the document search results to the user’s activities, making the search results much more relevant. The software reads the PDF files, extracts keywords, prepares an index, records user activity and delivers search results through a smart integration of all the information.
    Development platform: Java (Using DynamicPDF class library)

  • Online Goods Tracking for Shipping Corporation

    -

    Built the E-R model, data schema and database application for tracking goods at shipping ports and during transportation. The application enabled real-time monitoring and management of goods across various stages of transportation – right from pick-up to the ultimate delivery. Different user roles were created with varying access to data and management tools.
    Development platform: MySQL, PHP, HTML, CSS

  • Simulator for CPU scheduling algorithms

    -

    Designed and developed a simulator program that runs different CPU scheduling algorithms and produces utilization matrices including CPU utilization, waiting time of each process and average waiting time, response time of each process and average response time, turn-around time of each process and turn-around waiting time.
    Development platform: C++

Honors & Awards

  • Runner Up - Salesforce Labs Service Cloud Contest

    Salesforce

    Won second place in company wide "Salesforce Labs Service Cloud Contest" in 2017. Showcased the Salesforce app in Salesforce Annual Conference, Dreamforce 2017. The work got applauded and Salesforce customers showed keen interest.
    App is live on Salesforce app exchange: https://appexchange.salesforce.com/appxListingDetail?listingId=a0N3A00000EJaYZUA1

    App helps Case agents to set an "Out of Office" entry which automatically transfers their pending or new incoming cases to either a…

    Won second place in company wide "Salesforce Labs Service Cloud Contest" in 2017. Showcased the Salesforce app in Salesforce Annual Conference, Dreamforce 2017. The work got applauded and Salesforce customers showed keen interest.
    App is live on Salesforce app exchange: https://appexchange.salesforce.com/appxListingDetail?listingId=a0N3A00000EJaYZUA1

    App helps Case agents to set an "Out of Office" entry which automatically transfers their pending or new incoming cases to either a queue or a backup agent. Feature support to transfer active cases back once agent resumes work.

  • Finalist at Google Code for India Global Hackathon 2014

    Code for India

    Worked on very interesting food requirement simulation/prediction problem. Along with two UC Berkeley Graduate students developed an innovation solution/application that predicts the quantity of food required in each operational region based on the historical data. Solution to this problem was sought by Akshaya Patra, a non-profit organization in India that runs school lunch programme across India serving more than 1.2 million meals daily through their network in 9 states in India. Before this…

    Worked on very interesting food requirement simulation/prediction problem. Along with two UC Berkeley Graduate students developed an innovation solution/application that predicts the quantity of food required in each operational region based on the historical data. Solution to this problem was sought by Akshaya Patra, a non-profit organization in India that runs school lunch programme across India serving more than 1.2 million meals daily through their network in 9 states in India. Before this development, each day class teacher was giving a predicted count of students that would be attending next day and based on that the non profit prepares and serves food. The application prototype had following functionalities:
    * Predicted meal servings to be delivered
    * Feedback from teacher on number of meals served yesterday
    * Trend of food requirement
    * Time of food delivery
    * Update calendar for some specific holidays/events for that particular school
    * Comment on a particular food item from teacher
    * Recent delivery details such as route number, number of servings, etc.
    * Communicate with teachers through message delivery system

  • Runner-up at UCLA Developer's Contest 2014

    UCLA Anderson School of Management Entrepreneur Association

    Novel analytics solution for online education was the runner-up at UCLA Developer's Contest 2014

  • Yahoo Hack USA

    Yahoo

    The two-day hackathon celebrated collaboration, learning and innovation using the world’s top technologies from both YDN and tech partners. Selected developers were invited across USA to come together and build hacks, learn from experts and play with tech and interactive games.

  • Winner of Google Online Marketing Challenge 2013

    Google

    The goal was to create an effective online marketing campaign in Google AdWords. Won Android tablet as a token of appreciation along with the certificate.

  • Regional Finalist ACM ICPC - Amritapuri Region 2011-12

    ACM International Collegiate Programming Contest

    Represented University at Regional Finals in ACM International Collegiate Programming Contest 2011.

  • Best Cub Reporter

    Dainik Bhaskar Group

    Won “Best Cub-Reporter” Gold Medal for being the best reporter of a weekly school news supplement of Dainik Bhaskar, Indian Hindi-language daily newspaper owned by the Dainik Bhaskar Group.

Languages

  • English

    Full professional proficiency

  • Hindi

    Native or bilingual proficiency

Recommendations received

More activity by Anmol

View Anmol’s full profile

  • See who you know in common
  • Get introduced
  • Contact Anmol directly
Join to view full profile

Other similar profiles

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses