About
I…
Articles by Anmol
Activity
-
The first week of a supply-chain incident is visible. The real blind spot is often the next 1–3 months, when attackers test whether any trust…
The first week of a supply-chain incident is visible. The real blind spot is often the next 1–3 months, when attackers test whether any trust…
Liked by Anmol Rajpurohit
-
Want to lead a global team responsible for the most important product features – availability, reliability & security? Sumo Logic looking for a…
Want to lead a global team responsible for the most important product features – availability, reliability & security? Sumo Logic looking for a…
Liked by Anmol Rajpurohit
-
I am excited to join The Home Depot as Chief Technology Officer. Before starting this new chapter, I want to thank my colleagues at Ford. I am…
I am excited to join The Home Depot as Chief Technology Officer. Before starting this new chapter, I want to thank my colleagues at Ford. I am…
Liked by Anmol Rajpurohit
Experience & Education
Volunteer Experience
Publications
-
The Cardinal Sin of Data Mining and Data Science: Overfitting
Computer Society of India
Overfitting leads to public losing trust in research findings, many of which turn out to be false. We examine some famous examples, "the decline effect", Miss America age, and suggest approaches for avoiding overfitting.
Other authorsSee publication -
So you wanna be a data scientist? A guide to 2015's hottest profession
Mashable
See publicationByte from interview published in article on Mashable.
-
Big Data for Business Managers: Bridging the gap between potential and value
IEEE BigData 2013
See publicationThe research paper has been cited/referenced in over 40 research papers/books/articles.
Abstract — Given the surge of interest in research, publication and application on Big Data over the last few years, the potential of Big Data seems to be well-established now across businesses. However, in most of the business implementations Big Data still seem to be struggling to deliver the promised value (ROI). Such results despite using the market leading Big Data solutions and talented…The research paper has been cited/referenced in over 40 research papers/books/articles.
Abstract — Given the surge of interest in research, publication and application on Big Data over the last few years, the potential of Big Data seems to be well-established now across businesses. However, in most of the business implementations Big Data still seem to be struggling to deliver the promised value (ROI). Such results despite using the market leading Big Data solutions and talented deployment team are forcing the business managers to think what needs to be done differently.
This paper lays down the framework for business managers to understand Big Data processes. Besides providing a business overview of Big Data core components, the paper presents several questions that the managers must ask to assess the effectiveness of their Big Data processes.
This paper is based on the analysis of several Big Data projects that never delivered and comparison against successful ones. The hypothesis is developed based on public information and is proposed as the first step for business managers keen on effectively leveraging Big Data.
Keywords—Big Data, Analytics, Business Managers, Failure, Critical Errors, Potential, Value
Courses
-
Analysis of Programming Languages
CS 253
-
Big Data and Cloud Computing
CS 295
-
Data Structures
CS 261
-
Distributed Computer Systems
CS 230
-
Image Understanding
CS 216
-
Introduction to Artificial Intelligence
CS 271
-
Introduction to Embedded and Ubiquitous Systems
CS 244
-
Introduction to Machine Learning
CS 273a
-
Principles of Data Management
CS 222
Projects
-
Evolution of Artificial Neural Networks Over Distributed Environments
-
Paper Abstract:
During the last few decades the field of neural networks and massive parallel computing has evolved rapidly. Artificial Neural Networks have been studied since 1943, when Warren McCulloch and Walter Pitts wrote their first paper on how neurons might work. Since then there has been a tremendous research work on neural networks to simulate brain processing. Neural networks are computation intensive and therefore, parallelization helps to reduce the training time significantly.…Paper Abstract:
During the last few decades the field of neural networks and massive parallel computing has evolved rapidly. Artificial Neural Networks have been studied since 1943, when Warren McCulloch and Walter Pitts wrote their first paper on how neurons might work. Since then there has been a tremendous research work on neural networks to simulate brain processing. Neural networks are computation intensive and therefore, parallelization helps to reduce the training time significantly. In this paper we would cover how neural networks evolved over time in distributed environments. We would start with the early implementations of neural networks, where researchers started to leverage parallelism and cover the recent advances in this direction that has made neural networks such a big success. Neural networks are widely used in the field of natural language processing, image recognition, speech recognition, etc. Since neural networks generally contain a large number of layers it has been considered as one of the prominent Deep Learning architectures.Other creators -
Visual Dominance on Social Networks
-
Social media has become an important part of lives of everyone today. Nearly two-thirds of American adults (65%) use social networking sites. There has been a rapid development in this field during the past decade. Earlier, it was treated just a medium to interact online with friends and family. However, it has evolved over the years and now it is an integral part of not only individuals but also teams sitting in large corporate organizations. On the same line, there has been a constant…
Social media has become an important part of lives of everyone today. Nearly two-thirds of American adults (65%) use social networking sites. There has been a rapid development in this field during the past decade. Earlier, it was treated just a medium to interact online with friends and family. However, it has evolved over the years and now it is an integral part of not only individuals but also teams sitting in large corporate organizations. On the same line, there has been a constant development of tools and techniques to leverage social media at its best for marketing, campaigning and other purposes. There are umpteen tools available in the market, which perform text analysis to get hidden insights from mass chunk of social media data. Text analysis lays foundation for sentiment analysis, an art to understand the sentiment from text. However, with the evolution of social media, now it involves a large mass of media content such as pictures, videos, etc. Although everyone would agree with the expression by Tess Flanders that “Use a picture. It's worth a thousand words.” there has not been any significant development of analysis of media content such as images/videos on social networks. In this paper This project aimed towards filling that missing piece by implementing an approach to analyze images on a particular social network.
-
Auto-Tune Your Guitar
-
This embedded systems project involved:
Use of an android cell phone’s microphone to record sound that is played when a string is plucked.
Android application processes the signal from microphone to determine if the string is in tune or if the pitch is too low/high.
Phone communicates to an Arduino board via bluetooth.
Arduino controls a servo module to rotate the guitar peg accordingly.
The above process is repeated until the correct pitch is achieved.Other creators -
-
Implementation and Analysis of Search Algorithms in Connect Four Game
-
Report Summary(can be shared on request):
Adopted open-source implementation of Connect Four Game.
Worked on enabling computer as a strong opponent against human player.
Applied various mechanisms and analyzed their impact on the analysis of game tree and moves.
Performed a comparative analysis of various approaches starting from brute force to smart heuristics.Other creators -
-
Relational Database Management System Design and Implementation
-
Built a relational database system from scratch in C++ (The source code for the project is private as required by the course policy. If you are interested, I'll provide access based on your request):
Layer 0: Implemented a Paged File (PF) system and few operations of a Record-Based File (RBF) manager.
Layer 1: Finished all the operations of the RBF manager and implemented a Relation Manager (RM) on top of the basic record-based file system.
Layer 2: Implemented an Indexing (IX)…Built a relational database system from scratch in C++ (The source code for the project is private as required by the course policy. If you are interested, I'll provide access based on your request):
Layer 0: Implemented a Paged File (PF) system and few operations of a Record-Based File (RBF) manager.
Layer 1: Finished all the operations of the RBF manager and implemented a Relation Manager (RM) on top of the basic record-based file system.
Layer 2: Implemented an Indexing (IX) component which provides classes and methods for managing persistent indexes over unordered data records stored in files. Built indexes using Top-down B+ Tree approach.
Layer 3: Implemented a query engine which provided functions such as Filter, Project, Join (Grace Hash Join) and Aggregate. -
Predicting rainfall using ensemble of Ensembles
-
In-class competition held on Kaggle (http://inclass.kaggle.com/c/how-s-the-weather)
Presented my experiments with multiple approaches, ranging from Ensemble of Learning to Deep Learning Networks on the weather data to predict the rainfall.
Results for weighted ensemble of learners landed me in top 10% of class, with the testing root-mean-squared error being 0.5741. -
A Survey on Geography Based Routing in Mobile Ad Hoc Networks
-
Researched the current popular approaches to enhance MANet (Mobile Ad hoc NETworks) performance through location services. A comprehensive qualitative comparison was performed and documented in the research paper.
Key concerns with the existing protocols were identified and listed as opportunities for future research.
Simulation tool: NS2
Other creators -
URL Ranking Algorithm for Web Crawler Optimization
-
Designed an efficient algorithm to assist a web crawler through prioritization and optimization of the input URL set. The algorithm was tested against URL Reputation Data Set (a set of 16,000 URLs with over 3 million attributes). The design included data selection, preprocessing, entropy calculations and finally, information gain comparisons to deliver the list of selected URLs, sorted in order of importance.
Development platform: C/C++, MatlabOther creators -
Smart Document Management System
-
Developed a document management system that personalizes the document search results to the user’s activities, making the search results much more relevant. The software reads the PDF files, extracts keywords, prepares an index, records user activity and delivers search results through a smart integration of all the information.
Development platform: Java (Using DynamicPDF class library) -
Online Goods Tracking for Shipping Corporation
-
Built the E-R model, data schema and database application for tracking goods at shipping ports and during transportation. The application enabled real-time monitoring and management of goods across various stages of transportation – right from pick-up to the ultimate delivery. Different user roles were created with varying access to data and management tools.
Development platform: MySQL, PHP, HTML, CSS -
Simulator for CPU scheduling algorithms
-
Designed and developed a simulator program that runs different CPU scheduling algorithms and produces utilization matrices including CPU utilization, waiting time of each process and average waiting time, response time of each process and average response time, turn-around time of each process and turn-around waiting time.
Development platform: C++
Honors & Awards
-
Runner Up - Salesforce Labs Service Cloud Contest
Salesforce
Won second place in company wide "Salesforce Labs Service Cloud Contest" in 2017. Showcased the Salesforce app in Salesforce Annual Conference, Dreamforce 2017. The work got applauded and Salesforce customers showed keen interest.
App is live on Salesforce app exchange: https://appexchange.salesforce.com/appxListingDetail?listingId=a0N3A00000EJaYZUA1
App helps Case agents to set an "Out of Office" entry which automatically transfers their pending or new incoming cases to either a…Won second place in company wide "Salesforce Labs Service Cloud Contest" in 2017. Showcased the Salesforce app in Salesforce Annual Conference, Dreamforce 2017. The work got applauded and Salesforce customers showed keen interest.
App is live on Salesforce app exchange: https://appexchange.salesforce.com/appxListingDetail?listingId=a0N3A00000EJaYZUA1
App helps Case agents to set an "Out of Office" entry which automatically transfers their pending or new incoming cases to either a queue or a backup agent. Feature support to transfer active cases back once agent resumes work. -
Finalist at Google Code for India Global Hackathon 2014
Code for India
Worked on very interesting food requirement simulation/prediction problem. Along with two UC Berkeley Graduate students developed an innovation solution/application that predicts the quantity of food required in each operational region based on the historical data. Solution to this problem was sought by Akshaya Patra, a non-profit organization in India that runs school lunch programme across India serving more than 1.2 million meals daily through their network in 9 states in India. Before this…
Worked on very interesting food requirement simulation/prediction problem. Along with two UC Berkeley Graduate students developed an innovation solution/application that predicts the quantity of food required in each operational region based on the historical data. Solution to this problem was sought by Akshaya Patra, a non-profit organization in India that runs school lunch programme across India serving more than 1.2 million meals daily through their network in 9 states in India. Before this development, each day class teacher was giving a predicted count of students that would be attending next day and based on that the non profit prepares and serves food. The application prototype had following functionalities:
* Predicted meal servings to be delivered
* Feedback from teacher on number of meals served yesterday
* Trend of food requirement
* Time of food delivery
* Update calendar for some specific holidays/events for that particular school
* Comment on a particular food item from teacher
* Recent delivery details such as route number, number of servings, etc.
* Communicate with teachers through message delivery system -
Runner-up at UCLA Developer's Contest 2014
UCLA Anderson School of Management Entrepreneur Association
Novel analytics solution for online education was the runner-up at UCLA Developer's Contest 2014
-
Yahoo Hack USA
Yahoo
The two-day hackathon celebrated collaboration, learning and innovation using the world’s top technologies from both YDN and tech partners. Selected developers were invited across USA to come together and build hacks, learn from experts and play with tech and interactive games.
-
Winner of Google Online Marketing Challenge 2013
Google
The goal was to create an effective online marketing campaign in Google AdWords. Won Android tablet as a token of appreciation along with the certificate.
-
Regional Finalist ACM ICPC - Amritapuri Region 2011-12
ACM International Collegiate Programming Contest
Represented University at Regional Finals in ACM International Collegiate Programming Contest 2011.
-
Best Cub Reporter
Dainik Bhaskar Group
Won “Best Cub-Reporter” Gold Medal for being the best reporter of a weekly school news supplement of Dainik Bhaskar, Indian Hindi-language daily newspaper owned by the Dainik Bhaskar Group.
Languages
-
English
Full professional proficiency
-
Hindi
Native or bilingual proficiency
Recommendations received
14 people have recommended Anmol
Join now to viewMore activity by Anmol
-
A VP of Engineering told me last month: "We can build it ourselves. AI makes development fast now. Why buy when we can build?" I asked her one…
A VP of Engineering told me last month: "We can build it ourselves. AI makes development fast now. Why buy when we can build?" I asked her one…
Liked by Anmol Rajpurohit
-
The upcoming models are about to give attackers immense capabilities, as AI arms attackers with their most powerful weapons to date, research labs…
The upcoming models are about to give attackers immense capabilities, as AI arms attackers with their most powerful weapons to date, research labs…
Liked by Anmol Rajpurohit
-
Just wrapped RSAC. Still processing. AI agent security was everywhere this year — and one stat kept coming up: 82% of executives think their existing…
Just wrapped RSAC. Still processing. AI agent security was everywhere this year — and one stat kept coming up: 82% of executives think their existing…
Liked by Anmol Rajpurohit
-
Some personal news: this week marks my final week at SoFi. Reflecting on the last three years, the clear highlight has been the people and I am…
Some personal news: this week marks my final week at SoFi. Reflecting on the last three years, the clear highlight has been the people and I am…
Liked by Anmol Rajpurohit
-
Enterprise adoption isn't about features. It's about risk. In enterprise products, new features get the attention. But enterprise readiness is what…
Enterprise adoption isn't about features. It's about risk. In enterprise products, new features get the attention. But enterprise readiness is what…
Liked by Anmol Rajpurohit
-
The hardest part of being a first-time SDM — is not what you think. It isn’t the obvious weight of ownership. Nor the long hours, or the relentless…
The hardest part of being a first-time SDM — is not what you think. It isn’t the obvious weight of ownership. Nor the long hours, or the relentless…
Liked by Anmol Rajpurohit
Other similar profiles
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content