Example: Segmentation Using Clustering
Who is the best target for a cross-sell/up-sell campaign (an example from SAS Enterprise Miner)?
A consumer bank wants to segment its customers based on historic usage patterns to identify those who might benefit from new product offerings. Some customers might prefer an offer of a low-interest loan whereas others might prefer more convenience in online banking opportunities.
In order to identify customer usage patterns, the bank decides to segment its customers based on historic data about products that they hold. Segmentation was to be used for improving contact strategies in the Marketing Department.
Example: Segmentation Using Clustering
A sample of 100,000 active consumer customers was selected. An active consumer customer was defined as an individual or household with at least one checking account and at least one transaction on the account during a three-month study period.
All transactions during the three-month study period were recorded and classified into one of four activity categories:
1. traditional banking methods (TBM), 2. automatic teller machine (ATM), 3. point of sale (POS), 4. customer service (CSC).
Example: Segmentation Using Clustering
A three-month activity profile for each customer was developed by combining historic activity averages with observed activity during the study period. Historically, for one CSC transaction, an average customer would conduct two POS transactions, three ATM transactions, and ten TBM transactions.
Each customer was assigned this initial profile at the beginning of the study period. The initial profile was updated by adding the total number of transactions in each activity category over the entire three-month study period.
Segments After Applying Clustering
Segments After Applying Clustering
An Example from Machine Learning
Video - Embracing Uncertainty: Applied Machine Learning Comes of Age
Text Analytics and Text Mining
- 85% percent of all corporate data is captured and stored in some kind of unstructured form such as text and doubling in size every 18 months (Merrill Lynch and Gartner)
- Have to analyze these text information
- Text Analytics and Text Mining
- Text Mining: a semi-automated process of extracting knowledge from unstructured data sources (knowledge discovery in textual databases)
- Text Analytics = Information Retrieval + Information Extraction + Data Mining + Web Mining
- Text Analytics = Information Retrieval + Text Mining
Applications Areas and Disciplines
Data Mining vs Text Mining
- Semi-automated processes
- Discovering novel and usefule patterns
- Data mining is applied to structured data, e.g., in databases.
- Text mining is applied to unstructured data such as Word documents, PDF files, text excerpts, HTML/XML files, etc.
- In text mining, need to impose structure to the data and then mine the structured data.
Text Mining Applications
- Security - ECHELON surveillance system, EUROPOL's OASIS (Overall Analysis System for Intelligence Support) to track transnational organized crime, deception detection
- E-mail spam filtering, automatic response generation, prioritization and categorization
- Finance (quarterly reports)
- Medicine (discharge summaries)
- Marketing (better CRM by mining customer comments)
- Law (mining legal texts) - e.g., 92% of the supreme court cases are appeals of a non-constitutional nature
Text Mining Example - Deception Detection
- Difficult
- More difficult if limited to only text
- A study analyzed text-based testimonies of person of interests at military bases and used only text-based features (cues)
- Example cues: verb count, noun-phrase count, average number of clauses and sentence length to indicate complexity, modifiers and modal verbs to indicate uncertainty, passive voice and objectification to indicate nonimmediacy, typographical error ratio to indicate informality, ...
- 371 usable statements are generated and 31 features are used
- Results (overall % accuracy): Logistic regression 67.28, Decision trees 71.60, Neural networks 73.46
Text-Based Deception-Detection Process
Context Diagram for Text Mining Process
Three-Step/Task Text Mining Process
Term-by-Document Matrix Example
Sentiment Analysis
- Sentiment: belief, view, opinion, and conviction
- Answer "What do people feel about a certain topic?"
Analyzing data related to opinions of many using a variety of automated tools
- One important application of Sentiment Analysis is CRM: customers/consumers' opinions
- Other applications: voice of employee, voice of the market, brand management, financial Markets, politics, government intelligence, etc.
Sentiment Analysis Process
P-N Polarity and S-O Polarity
Methods for Polarity Identification
- Using a Lexicon - WordNet (wordnet.princeton.edu) is a general-purpose lexicon database of English. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a distinct concept. An extension of WordNet includes polarity (Positive-Negative; P-N) and objectivity (Subjective-Objective, S-O) labels for each term in the lexicon.
- Using a Collection of Training Documents - perform sentiment classification using statistical analysis and machine learning tools based on the vast resources of labeled (manually by annotators or using a star/point system) documents such as RottenTomatoes, Internet Movie Database, Amazone, C-NET, eBay.
4. Digital Analytics
- Web Mining
- Search Engines
- Web Analytics
- Social Network Analysis
- Social Media Analytics
Digital Analytics
- Digital Analytics models data collected in interactive channels such as web, social media, and mobile to help make strategic business decisions and maintain a competitive edge.
- Need methods from descriptive and predictive analytics including data mining and text mining described before and other methods
- Web contains a lot of data in HTML, XML, text format
- Web mining is the process of discovering intrinsic relationships from textual/linkage/usage data on the web
Web Mining
Web Content and Structure Mining
- Mining Web's textual content data collected using Web crawlers
- Components of web pages: hyperlinks, authoritative pages, hubs (a collection of links to authoritative pages)
Web Usage Mining or Web Analytics
- Extract information from data generated through Web page visits and transactions (server access logs, referrer logs, agent logs, and client-side cookies, user characteristics, usage profiles, metadata such as page attributes, content attributes, and usage data)
- Clickstream data analysis
- Understand user behavior!
Clickstream Data Analysis
Web Usage Mining Applications
- Determine the lifetime value of clients
- Design cross-marketing strategies across products
- Evaluate promotional campaigns
- Target electronic ads and coupons at user groups based on user access patterns
- Predict user behavior based on previously learned rules and users' profiles
- Present dynamic information to users based on their interests and profiles, etc.
Search Engine
Ranking for Xbox & Bing
Ranking for Xbox & Bing
Search Engine Optimization
- Intentionally increase the visibility of an e-commerce site or a Web site in a search engine's natural (unpaid or organic) search results
- Internet marketing strategy
- Content, HTML, keywords, external links, Indexing (Webmaster submission of URL, proactively and continuously crawling the Web)
Web Analytics Metrics
- Website usability (page views, time on site, downloads, click paths)
- Traffic sources (referral websites, search engines, direct, offline/online campaigns)
- Visitor profiles (keywords, content groupings, geography, time of day, landing page profiles)
- Conversion statistics (new visitors, returning visitor, leads, sales/conversions, abandonment/exit rate)
A Web Analytics Dashboard
Web Mining
An Example from My Own Research
Study customer behavior using
Taobao's P4P
Data
- Is there a relationship between the number of product advertisements shown in response to a search and the distribution of prices of products advertised?
- Does the coefficient of variation of prices shown in advertisements have a relationship with the number of advertisements when the search data is grouped into two groups based on whether a search keyword(s) is present or not?
- Does the number of advertisements shown depend on whether one initiates the search with a keyword?
- Does the amount of time spent by the customer before clicking any advertisement, if that happens, depend on whether search keyword is present or not? What moderating variables are involved in this relationship? And many other research questions ...
Social Network Analysis
- Interdisciplinary field of Social psychology, Sociology, Statistics, Graph theory
- Study relationships between individuals, groups, organizations, societies
- Communication networks, community networks, criminal networks, innovation networks, ...
- Metrics: connection (multiplexity, network closure, etc.), distribution (bridge, centrality, density, etc.), segmentation (cliques and social circles, clustering coefficient, cohesion)
Social Media Analytics
- Social media: 1. collaborative projects (Wikipedia), 2. blogs and microblogs (Twitter), 3. content communities (YouTube), 4. social networking sites (Facebook), 5. virtual game worlds (World of Warcraft), 6. virtual social worlds (Second Life)
- Study relationships between individuals, groups, organizations, societies
- For business: the systematic and scientific ways to consume the vast amount of content created by Web-based social media outlets, tools, and techniques for the betterment of an organization's competitiveness
- Tools to measure social media impact: Descriptive and Predictive Analytics, Social Network Analysis
Sentiment Analysis using R and Twitter
PREREQUISITES:
- R and RStudio
- R libraries: twitteR, RCurl, RJSONIO, stringr, bitops, httr, tm, wordcloud, ggplot2, plyr, gridExtra, RSentiment
- twitter account
- Consumer Key (API key) & Consumer Secret (API Secret) - https://apps.twitter.com
- Access Token & Token Secret
- R code
Word Cloud - Visual Representation of Text Data
Another Example: Web Scraping
5. Prescriptive Analytics
- Prescriptive Analytics follows Descriptive and Predictive Analytics which produce results like choice alternative
- What's the best possible business decision?
- Optimization and Simulation
- An Example: A brand manager for ColCal Products must determine how much time to allocate between radio and television advertising during the next month. Market research has provided estimates of the audience exposure for each minute of advertising in each medium, which it would like to maximize.
Costs per minute of advertising are also known, and the manager has a limited budget of $25, 000: Exposure per minute (radio 350, TV 800); Cost per minute (radio $400, TV $2,000). The manager has decided that because television ads have been found to be much more effective than radio ads, at least
70% of the time should be allocated to elevision. (Solution using Excel's Solver)
Optimize Sales Planning at NBC
- NBC wants to maximize the revenues for the available fixed amount of advertising slots every May for the following television broadcast year starting in the third week of September. Typical client request: dollar amount, demographic interested, program mix, weekly weighting, unit-length distribution, negotiated cost per 1,000 viewers
- Solution: NBC developed a linear programming model to sell the available advertising slots in a optimal manner by minimizing the amount of premium ad slots assigned to a plan and the total penalty incurred in meeting management's goals while meeting constraints on available ad slots, airtime availability, product conflicts, client requirements, budget, show-mix, weekly weighting, and unit-mix.
- Decision Variables: the numbers of commercials of each spot length requested by the client that are to be placed in the shows and weeks included in the sales plan
- Objective Function: total value of ad slots assigned to the sales plan and the penalties incurred in not meeting the client requirements these systems have provided
Categories Of Models
Components Of Models
Optimization Models
- Too Many: Assignment (best matching of objects), Dynamic programming, Goal programming, Investment (maximizing rate of return), Linear and integer programming, Network models for planning and scheduling, Nonlinear programming, Replacement (capital budgeting), Inventory models (e.g., economic order quantity), Transportation (minimize cost of shipments)
- Most of these models are from the discipline Operations Research started in World War II (e.g., reduce the number of anti-aircraft artillery rounds needed to shoot down an enemy aircraft from an average of over 20,000 at the start of the Battle of Britain to 4,000 in 1941)
- Microsoft Excel's Solver is very capable and there are other optimization tools
- Example: Airline Scheduling
Simulation
- "Appearance" of reality
- Conduct experiments with a computer on a comprehensive model of a complex (too complex for numerical optimization) system or problem to assess its dynamic behavior
- Probabilistic Simulation: one or more of the independent variables (e.g., the number of passengers waiting at an airport) follow certain probability distributions
- Monte Carlo Simulation: obtain numerical results through repeated random sampling
- Discrete Event Simulation: model a system where the interaction between different entities is studied, e.g., customer queuing system
- Arena Discrete Event Simulation
- Lots of probability and statistics and respect "Variation, Variation, Variation" in statistics (Example)
Simulation Example using Arena - Airport Security
Answer the following questions from the Category Overview Report on this airport security analysis process simulation from
Arena, a discrete event simulation software:
- On average, how long did passengers spend in the modeled process?
- What was the average cost of reviewing a passenger's identification?
- What was the longest time a passenger spent in the process?
- What was the maximum number of passengers waiting for identification check?
- What proportion of time was the security officer busy?
6. Emerging Trends
- Data Science and Data Scientist
- Internet of Things (IoT) and Business Analytics
- Cloud Computing and Business Analytics
- Location-Based Analytics
Data Science and Data Scientist
- D. J. Patil of LinkedIn credited for creating the term "data science"
- Data scientist responsible for predictive analysis, statistical analysis, and more advanced analytical tools and algorithms (data mining, knowledge discovery, or machine learning)
- Write code for data cleaning/analysis in Web-oriented languages like Java or Python and statistical languages such as R
- Significant expertise in statistics (modeling, designed experiments, analysis), and also in operations research (optimization, simulation, etc.)
- Same knowledge and skills described for Business Analytics
- Computer science, statistics, and applied mathematics programs prefer the data science label but there is no distinction between (business) analytics and data science
Skills That Define a Data Scientist
Internet of Things (IoT)
- Internet of Things: Connecting physical world to the Internet, machine-to-machine
- Internet of People: Connecting humans to each other through technology, e.g., facebook, twitter
- Enablers of Internet of Things: sensors and sensing devices
- Examples: Self-driving cars, Fitness trackers, Smartbin ¨C trash detectors detecting fill levels, Smart refrigerators, and other appliances
- By 2020, another 38 billion things will be connected to the Internet
- Help build smart cities, smart cars, smart grid, smart anything
Building Blocks of IoT Technology Infrastructure
Internet of Things (IoT) Ecosystem
Cloud Computing and Business Analytics
- A style of pay-per-use computing in which dynamically scalable and often virtualized resources are provided over the Internet.
- Users need not have knowledge of, experience in, or control over the technology infrastructures in the cloud that supports them.
- Cloud computing = utility computing, application service provider grid computing, on-demand computing, software-as-a-service (SaaS), etc.
- Cloud computing service companies: Amazon Web Services (AWS), salesforce.com, Microsoft, IBM, Google, etc.
- Analytics as a Service in Cloud: Amazon Elastic Beanstalk, IBM Bluemix, Microsoft Azure, Google App Engine, Red Hat's OpenShift, Teradata - Aster Analytics as a Service, IBM Watson Analytics, MineMyText.com, SAS Visual Analytic and Visual Statistics, Tableau, Showflake, Predix by General Electric
Different Types of Cloud Offerings
Location-Based Analytics
Examples
- Loan Default Rate - A US State Heat Map (JMP Demo)
- Retailers - location + demographic details combined with other transactional data can help determine how sales vary by population level, assess locational proximity to other competitors and their offerings, assess the demand variations and efficiency of supply chain operations, analyze customer needs and complaints, better target different customer segments, etc.
- CabSense - finding a taxi in New York City, Rating of street corners, interactive maps, etc.
- The Case of the Dropped Mobile Calls