Automatic Gender Identification using Cell Phone Calling Behavior Presented by David.

Slides:



Advertisements
Similar presentations
Aspire Vertical Markets Banking, Finance and Insurance.
Advertisements

Enhancing Policy Decision Making with Large-Scale Digital Traces Vanessa Frias-Martinez University of Maryland NFAIS, February 2014.
An Overview of Machine Learning
Soft Skills for a Digital Workplace: Verbal Communication Unit C: Developing Professional Telephone Skills.
Investigations of Cell Phone Use While Driving in NC Jane Stutts William Hunter Herman Huang University of North Carolina Highway Safety Research Center.
Transportation mode detection using mobile phones and GIS information Leon Stenneth, Ouri Wolfson, Philip Yu, Bo Xu 1University of Illinois, Chicago.
Landmark Classification in Large- scale Image Collections Yunpeng Li David J. Crandall Daniel P. Huttenlocher ICCV 2009.
1 Toward More Sensitive Mobile Phones Ken Hinckley Eric Horvitz Attentional User Interfaces Project Microsoft Research.
Bonrix Advance Voice Call Manager Bonrix Software Systems Call us : , us at A-801, Samudra Complex,
Lecture 5 (Classification with Decision Trees)
Semi-supervised protein classification using cluster kernels Jason Weston, Christina Leslie, Eugene Ie, Dengyong Zhou, Andre Elisseeff and William Stafford.
Presentation of approach and pilot results Mannheim, March 20-22, 2015 You walk, you travel, you use your phone – differently!
Biometrics Kyle O'Meara April 14, Contents Introduction Specific Types of Biometrics Examples Personal Experience Questions.
FACILITATE OUTGOING PHONE CALLS
Ubiquitous Advertising: the Killer Application for the 21st Century Author: John Krumm Presenter: Anh P. Nguyen
Sales Force Automation and Automated Customer Service Centers
PAKDD'15 DATA MINING COMPETITION: GENDER PREDICTION BASED ON E-COMMERCE DATA Team members: Maria Brbic, Dragan Gamberger, Jan Kralj, Matej Mihelcic, Matija.
SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.
Modeling and Detecting Anomalous Topic Access Siddharth Gupta 1, Casey Hanson 2, Carl A Gunter 3, Mario Frank 4, David Liebovitz 4, Bradley Malin 6 1,2,3,4.
Auto Technologies Inc. Auto Technologies Call Cap Telemanagement System “ATTS”
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Identifying Feature Relevance Using a Random Forest Jeremy Rogers & Steve Gunn.
Mute Button 4 Speed Dial Buttons Follow Me/Change Modes Button Earphone Jack SIM Card Slot Conference Call Button Speaker ‘+’ Volume ‘-’ Volume PSTN Line.
Active Learning on Spatial Data Christine Körner Fraunhofer AIS, Uni Bonn.
Feature selection LING 572 Fei Xia Week 4: 1/29/08 1.
Semi-supervised Learning on Partially Labeled Imbalanced Data May 16, 2010 Jianjun Xie and Tao Xiong.
 To login to your Webportal go to and enter your User ID and Password. Call Forwarding Always Automatically.
2001/11/27IDS Lab Seminar1 Adaptive Fraud Detection Advisor: Dr. Hsu Graduate: Yung-Chu Lin Source: Fawcett, Tom and Foster Provost, Journal of Data Mining.
Problems in large-scale computer vision David Crandall School of Informatics and Computing Indiana University.
“A.T.T.S.” Never Miss A Call! Automated Tracking Telemanagement System.
Combining multiple learners Usman Roshan. Bagging Randomly sample training data Determine classifier C i on sampled data Goto step 1 and repeat m times.
Randomization in Privacy Preserving Data Mining Agrawal, R., and Srikant, R. Privacy-Preserving Data Mining, ACM SIGMOD’00 the following slides include.
MACHINE LEARNING 8. Clustering. Motivation Based on E ALPAYDIN 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2  Classification problem:
Supervised Clustering of Label Ranking Data Mihajlo Grbovic, Nemanja Djuric, Slobodan Vucetic {mihajlo.grbovic, nemanja.djuric,
COP5992 – DATA MINING TERM PROJECT RANDOM SUBSPACE METHOD + CO-TRAINING by SELIM KALAYCI.
Travel Data and the Smartphone: Building an International Travel Dataset One Android User at a Time GIL TAL MICHAEL NICHOLAS MATTHEW FAVETTI.
Apache Mahout Qiaodi Zhuang Xijing Zhang.
Speaker : Shau-Shiang Hung ( 洪紹祥 ) Adviser : Shu-Chen Cheng ( 鄭淑真 ) Date : 99/05/04 1 Qirui Zhang, Jinghua Tan, Huaying Zhou, Weiye Tao, Kejing He, "Machine.
CS378 Final Project The Netflix Data Set Class Project Ideas and Guidelines.
Lecture Notes for Chapter 4 Introduction to Data Mining
Date: 2011/1/11 Advisor: Dr. Koh. Jia-Ling Speaker: Lin, Yi-Jhen Mr. KNN: Soft Relevance for Multi-label Classification (CIKM’10) 1.
MX Automatic Call Distribution (ACD) v3.0 New Features.
Competition II: Springleaf Sha Li (Team leader) Xiaoyan Chong, Minglu Ma, Yue Wang CAMCOS Fall 2015 San Jose State University.
Locally Linear Support Vector Machines Ľubor Ladický Philip H.S. Torr.
Postacademic Interuniversity Course in Information Technology – Module C1p1 Chapter 5 Multiplexing.
BMTRY 789 Lecture 6: Proc Sort, Random Number Generators, and Do Loops Readings – Chapters 5 & 6 Lab Problem - Brain Teaser Homework Due – HW 2 Homework.
1 Automatic Cluster Detection Automatic Cluster Detection is useful to find “better behaved” clusters of data within a larger dataset; seeing the forest.
Musical Genre Categorization Using Support Vector Machines Shu Wang.
On Using SIFT Descriptors for Image Parameter Evaluation Authors: Patrick M. McInerney 1, Juan M. Banda 1, and Rafal A. Angryk 2 1 Montana State University,
6.S093 Visual Recognition through Machine Learning Competition Image by kirkh.deviantart.com Joseph Lim and Aditya Khosla Acknowledgment: Many slides from.
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
Communication Methods
Jajah Speaker: Yan-Shiang Wang Date: Classic Caller Callee PBX PSTN Signaling Voice.
Introduction Exploring Categorical Variables Exploring Numerical Variables Exploring Categorical/Numerical Variables Selecting Interesting Subsets of Data.
Jajah Speaker: Yan-Shiang Wang Date: Classic Caller Callee PBX PSTN Signaling Voice.
Date of download: 6/1/2016 Copyright © 2016 SPIE. All rights reserved. Diagrams of the (a) OC-SVM classifier and (b) MHOC-SVM classifier. Figure Legend:
Where Did You Go: Personalized Annotation of Mobility Records
Mobility diversity and socio-economic development
Artist Identification Based on Song Analysis
Data Analysis Lab 03 Adding Control Variables to Crosstabs
Source: Procedia Computer Science(2015)70:
Introducing EasyVoIZ 5 Version 5.
Dieudo Mulamba November 2017
Using Transductive SVMs for Object Classification in Images
A survey of network anomaly detection techniques
جامعة الملك سعود – المملكة العربية السعودية
TIME LINE.
Title: _____________________
University of Washington, Autumn 2018
Presentation transcript:

Automatic Gender Identification using Cell Phone Calling Behavior Presented by David

Motivation Existing gender classification – Based on voice – Based on image – Violate user privacy Purpose of this work – understanding how phone usage associated with gender id – evaluating common approaches for gender id

Call Data Records Dataset – 2 M calls – 10 K phone numbers Features – encrypted cell phone numbers of caller and callee – the date and time of the call – the duration of the call – The initial and final location of the caller while making the call

Variables Behavioral Variables – Number of Calls – Average Duration of Calls – Expenses Social Variables – In Degree – Out Degree – Degree Mobility Variables – Talk Distance – Route Distance

Behavior vs. Gender Ranked distributions charts

Behavior vs. Gender (Cont.) Ranked distributions charts Selected features – number of incoming calls – Number of outgoing calls – average duration of incoming calls – average duration of outgoing calls – Expenses – Degree

Gender Classification SVM Random Forests

Gender Classification (Cont.) Semi-supervised (K-means + Labeling + KNN)