Find White Papers
Home
About Us
List Your Papers
    
> Genpact > Improving predictive power of Binary Response model using Multi Step Logistic Approach

Improving predictive power of Binary Response model using Multi Step Logistic Approach

White Paper Published By: Genpact

This white paper will discuss Multi Step Logistic Model which is a Non-Optimization technique but may be one of the most useful technique when cost of Maximization/Minimization is high i.e. when proportion of 1 and 0 are distinctly different i.e. when we can only achieve goal (Tagging actual  1 as 1 and actual Zero as 0) by few % at cost of huge misclassification of other.



Tags : 
binary response model, multi step logistic approach, genpact, software development, c++, database development, java, visual basic

Genpact
Published:  Nov 20, 2008
Type:  White Paper
Length:  13 pages

IMPROVING PREDICTIVE POWER OF BINARY
RESPONSE MODEL USING MULTI STEP LOGISTIC
APPROACH Sandeep Das * Senior Consultant, Analytics Genpact India, th thDLF IT Park, Tower 1, 7 & 8 Floor 8 Major Arterial Road, New Town Rajarhat, Kolkata - 700156 India April 2009 * Corresponding author: Tel: +91-9836268676 E-mail address: sandeep.das1@genpact.com







ABSTRACT:

This paper discusses a methodology called "Multi Step Logistic Regression" to improve the
predictive power of the binary logistic regression model in terms of a higher Hit/Miss ratio. A
'Hit' is defined as right classification/tagging and a 'Miss' is defined as wrong classification
obtained from cross tabulation between actual vs. predicted tagging. In this approach, after
choosing the final cut logistic model, the model building population is segregated into two
parts - predicted 1 and predicted 0 by selecting a cut off on predicted probability distribution.
For predicted 1 group, parameter estimates are re-estimated keeping the same variables came
significant for initial model. User may choose to introduce new variables in each iteration and
keep them in the model as per significance. These steps are iteratively repeated till we get a
good cost-benefit cause to stop. The conventional logistic method (single step) doesn't help to
tackle a situation where the proportion of 1 & 0 distinctly different or cost of misallocation is
high. To tackle such a situation, we will discuss this alternative approach. This paper targets
to improve the concentration in Hit cells with (without) tolerable/regulated (alarming) increase
in concentration of misclassification compared to the Single step approach.

KEY WORDS: Multistep Logistic, Binary Response Model, Improving predictive power of
Probability of Default (PD) model.

PAPER TYPE: Data Analysis in Banking and Financial Services











1 1. INTRODUCTION:

In the standard binary logistic model building exercise, we first finalize the choice of a model.
We then choose cut off on predicted probability distribution to define Predicted 1 & 0. But we
have no control on the Hit/Miss ratio associated with choice of cut off. As we can not regulate
migration of the cell elements within different Hit & Miss cells only by choice of different
cutoffs. In this paper, we will discuss Multi Step Logistics as a probable way to help improve
Hit/Miss ratio of the model. The industry verticals where this could be applicable are Risk,
Credit Risk, Bankruptcy forecasting etc. This methodology can also be used in other domains
where we want to predict dichotomous response variables like Yes/No or , Good/Bad and when
one of the category has very high/low population.

2. BACKGROUND:

Let's take a standard logistic score card building scenario where we are modeling for Bad, at
the end essentially this generates predicted probability distribution of Bad. In many business
scenarios, we find that proportion of bad vs good is distinctly different. For example, if we
build a response model of a mailing campaign then it is expected that a very few actual
responses will be available. In such a situation, the stability of the logistic model is itself
questionable. Here, we either increase the number of responses by biased sampling or use
specific algorithm(s) like 'zero inflated models' or 'modeling rare events' or sometimes 'neural
network' logic to improve model prediction power. These algorithms are complex and also not
user friendly for implementation. In this paper, we will discuss a methodology which is
suitable when we want to increase Hit/Miss ratio compared to that of Single Step logistic 1regression. A Single Step logistic approach is the standard ... [download for more]

Browse Technology Topics

Data Center

Virtualization, Cloud Computing, Infrastructure, Design and Facilities, Power and Cooling, Green Computing  
    

Data Management

Application Integration, Analytical Applications, Business Intelligence, Configuration Management, Database Development, Data Integration, Data Mining, Data Protection, Data Quality, Data Replication, Database Security, EDI, SOAP, Service Oriented Architecture, Web Service Management, Data Warehousing  
    

Enterprise Applications

Application Integration, Application Performance Management, Best Practices, Business Activity Monitoring, Business Analytics, Business Integration, Business Intelligence, Business Management, Business Metrics, Business Process Automation, Business Process Management, Call Center Management, Call Center Software, Change Management, Corporate Governance, Customer Interaction Service, Customer Relationship Management, Customer Satisfaction, Customer Service, EBusiness, Enterprise Resource Planning, Enterprise Software, EProcurement, Extranets, Groupware Workflow, HIPAA Compliance, IP Faxing, IT Spending, Marketing Automation, Performance Testing, Product Lifecycle Management, Project Management, Return On Investment, Risk Management, Sales & Marketing Software, Sales Automation, Server Virtualization, Simulation Software, Supply Chain Management, System Management Software, Total Cost of Ownership, Video Conferencing, Voice Recognition, Voice Over IP, Workforce Management, Incentive Compensation, Spend Management, Manufacturing Execution Systems, International Computing  

Human Resource Technology

Human Resources Services, Payroll Software, Time and Attendance Software, Workforce Management Software, Financial Management, Employee Monitoring Software, Employee Training Software, Recruiting Software/Services, Employee Performance Management, ELearning, Benefits Management, Expense Management  
    

IT Career Advancement

Cisco Certification, Microsoft Certification, Linux Certification, Network Security Certification, Software Development Certification  

IT Management

Employee Performance, ITIL, Productivity, Project Management, Software Compliance, Sarbanes Oxley Compliance, Service Management, Desktop Management  
    

Knowledge Management

Collaboration, Collaborative Commerce, Contact Management, Content Delivery, Content Integration, Content Management System, Corporate Portals, Customer Experience Management, Document Management, Information Management, Intranets, Messaging, Records Management, Search And Retrieval, Search Engines, Secure Content Management, SLA  

Networking

Active Directory, Bandwidth Management, Convergence, Distributed Computing, Ethernet Networking, Fibre Channel, Gigabit Networking, Governance, Grid Computing, Infrastructure, Internetworking Hardware, Interoperability, IP Networks, IP Telephony, Local Area Networking, Load Balancing, Migration, Monitoring, Network Architecture, Network Management, Network Performance, Network Performance Management, Network Provisioning, Network Security, OLAP, Optical Networking, Quality Of Service, Remote Access, Remote Network Management, Server Hardware, Servers, Small Business Networks, TCP/IP Protocol, Test And Measurement, Traffic Management, Tunneling, Utility Computing, VPN, Wide Area Networks, Green Computing, Cloud Computing, Power and Cooling, Data Center Design and Management, Colocation and Web Hosting  
    

Platforms

AS/400, Domino, Linux, Microsoft Exchange, Oracle, PeopleSoft, SAP, Siebel, Solaris, Tivoli, Unix, Web Sphere, Windows, Windows Server  

Security

Access Control, Anti Spam, Anti Spyware, Anti Virus, Application Security, Auditing, Authentication, Biometrics, Business Continuity, Compliance, DDoS, Disaster Recovery, Email Security, Encryption, Firewalls, Hacker Detection, High Availability, Identity Management, Internet Security, Intrusion Detection, Intrusion Prevention, IPSec, Network Security Appliance, Password Management, Patch Management, Phishing, PKI, Policy Based Management, Security Management, Security Policies, Single Sign On, SSL, Secure Instant Messaging, Web Service Security, PCI Compliance, Vulnerability Management  
    

Software Development

.NET, C++, Database Development, Java, Middleware, Open Source, Software Outsourcing, Quality Assurance, Scripting, SOAP, Software Testing, Visual Basic, Web Development, Web Services, Web Service Security, XML  

Storage

Backup And Recovery, Blade Servers, Clustering, IP Storage, ISCSI, Network Attached Storage, RAID, Storage Area Networks, Storage Management, Storage Virtualization, Email Archiving, Data Deduplication  
    

Wireless

802.11, Bluetooth, CDMA, GPS, Mobile Computing, Mobile Data Systems, Mobile Workers, PDA, RFID, Smart Phones, WiFi, Wireless Application Software, Wireless Communications, Wireless Hardware, Wireless Infrastructure, Wireless Messaging, Wireless Phones, Wireless Security, Wireless Service Providers, WLAN  
Search