机器学习评估指标 AUC 综述

在互联网的排序业务中，比如搜索、推荐、广告等，AUC ( Area under the Curve of ROC ) 是一个非常常见的评估指标。网上关于 AUC 的资料很多，知乎上也有不少精彩的讨论，本文尝试基于自身对 AUC 的理解做个综述，水平有限，欢迎指出错误。
俗话说，提出正确的问题就成功了一半，本文先提出以下几个问题，希望大家读完能够加深对下列问题的理解。

AUC 有几种理解？
AUC 的什么特性让它如此受欢迎？
AUC 的值和什么有关，多高是高？
AUC 提高了是否代表线上指标会提高？
有没有更好的指标替代 AUC？

几种 AUC 的理解

一般有两大类解释，一种是基于 ROC 线下面积，需要理解混淆矩阵，包括精确率、召回率、F1 值、ROC 等指标的含义。另外一种是基于概率的解释，模型的排序能力。
在参考[1]和[4]中，关于 AUC 定义本身的讨论非常详细，上述两大类都有不同形式的解释。还包括如何用 AUC 做目标去优化，AUC 的各种计算方法，本文不再赘述，有兴趣的同学自己去看下。

AUC 的排序特性
对比 accuracy、precision 等指标，AUC 指标本身和模型预测 score 绝对值无关，只关注排序效果，因此特别适合排序业务。
为何与模型预测 score 值无关为何是很好的特性呢？假设你采用 precision、F1 等指标，而模型预测的 score 是个概率值，就必须选择一个阈值来决定哪些样本预测是1哪些是0，不同的阈值选择，precision 的值会不同，而 AUC 可以直接使用 score 本身，参考的是相对顺序，更加好用。
相对于 ROC 线下面积的解释，个人更喜欢排序能力的解释。参考[2]的解释通俗易懂：
例如0.7的 AUC，其含义可以大概理解为：给定一个正样本和一个负样本，在70%的情况下，模型对正样本的打分高于对负样本的打分。可以看出在这个解释下，我们关心的只有正负样本之间的分数高低，而具体的分值则无关紧要。

AUC 对均匀正负样本采样不敏感

正由于 AUC 对分值本身不敏感，故常见的正负样本采样，并不会导致 AUC 的变化。比如在点击率预估中，处于计算资源的考虑，有时候会对负样本做负采样，但由于采样完后并不影响正负样本的顺序分布。
即假设采样是随机的，采样完成后，给定一条正样本，模型预测为 score1，由于采样随机，则大于 score1 的负样本和小于 score1 的负样本的比例不会发生变化。
但如果采样不是均匀的，比如采用 word2vec 的 negative sample，其负样本更偏向于从热门样本中采样，则会发现 AUC 值发生剧烈变化。

AUC 值本身有何意义
我们在实际业务中，常常会发现点击率模型的 AUC 要低于购买转化率模型的 AUC。正如前文所提，AUC 代表模型预估样本之间的排序关系，即正负样本之间预测的 gap 越大，AUC 越大。
通常，点击行为的成本要低于购买行为，从业务上理解，点击率模型中正负样本的差别要小于购买力模型，即购买转化模型的正样本通常更容易被预测准。

细心的童鞋会想，既然 AUC 的值和业务数据本身有关，那么它的值为多少的时候算好呢？

AUC 值本身的理论上线
假设我们拥有一个无比强大的模型，可以准确预测每一条样本的概率，那么该模型的 AUC 是否为1呢？现实常常很残酷，样本数据中本身就会存在大量的歧义样本，即特征集合完全一致，但 label 却不同。因此就算拥有如此强大的模型，也不能让 AUC 为1。
因此，当我们拿到样本数据时，第一步应该看看有多少样本是特征重复，但 label 不同，这部分的比率越大，代表其“必须犯的错误”越多。学术上称它们为 Bayes Error Rate，也可以从不可优化的角度去理解。
我们花了大量精力做的特征工程，很大程度上在缓解这个问题。当增加一个特征时，观察下时候减少样本中的 BER，可作为特征构建的一个参考指标。

AUC 与线上业务指标的宏观关系
AUC 毕竟是线下离线评估指标，与线上真实业务指标有差别。差别越小则 AUC 的参考性越高。比如上文提到的点击率模型和购买转化率模型，虽然购买转化率模型的 AUC 会高于点击率模型，但往往都是点击率模型更容易做，线上效果更好。
购买决策比点击决策过程长、成本重，且用户购买决策受很多场外因素影响，比如预算不够、在别的平台找到更便宜的了、知乎上看了评测觉得不好等等原因，这部分信息无法收集到，导致最终样本包含的信息缺少较大，模型的离线 AUC 与线上业务指标差异变大。
总结起来，样本数据包含的信息越接近线上，则离线指标与线上指标 gap 越小。而决策链路越长，信息丢失就越多，则更难做到线下线上一致。

AUC 提升和业务指标不一致
好在实际的工作中，常常是模型迭代的 auc 比较，即新模型比老模型 auc 高，代表新模型对正负样本的排序能力比老模型好。理论上，这个时候上线 abtest，应该能看到 ctr 之类的线上指标增长。
实际上经常会发生不一致，首先，我们得排除一些低级错误：
1. 排除 bug，线上线下模型 predict 的结果要符合预期。
2. 谨防样本穿越。比如样本中有时间序类的特征，但 train、test 的数据切分没有考虑时间因子，则容易造成穿越。

更多细节请看参考[3]和[5]

AUC 计算逻辑不足与改进
AUC 计算是基于模型对全集样本的的排序能力，而真实线上场景，往往只考虑一个用户一个 session 下的排序关系。这里的 gap 往往导致一些问题。正如参考[3]中的举例的几个 case，比较典型。主要包括两点：
线上会出现新样本，在线下没有见过，造成 AUC 不足。这部分更多是采用 online learning 的方式去缓解，AUC 本身可改进的不多。
线上的排序发生在一个用户的 session 下，而线下计算全集 AUC，即把 user1 点击的正样本排序高于 user2 未点击的负样本是没有实际意义的，但线下 auc 计算的时候考虑了它。

阿里在论文：Deep Interest Network for Click-Through Rate Prediction 中提出了 group auc 来处理上述问题。公式如下：

即以 user 为 group，在对 user 的 impression 做加权平均。私以为，只是对用户做 group 还不够，应该是基于 session 去做 group。

最后，AUC 这个问题是在模型优化到一定程度才需要考虑的。大部分情况下，如果模型的 auc 有大幅提升，线上效果一般是一致的。如果不一致，请先从产品形态去思考有没有坑。

参考资料：
1. 如何理解机器学习和统计中的 AUC？
https://www.zhihu.com/question/39840928

2. 多高的 AUC 才算高
https://zhuanlan.zhihu.com/p/24217322

3. 线下 AUC 提升为什么不能带来线上效果提升？–测试和评估的一些真相
https://zhuanlan.zhihu.com/p/35459467

4. 精确率、召回率、F1 值、ROC、AUC 各自的优缺点是什么？
https://www.zhihu.com/question/30643044

5. 如何解决离线 auc 和线上点击率不一致的问题？
https://www.zhihu.com/question/305823078/answer/552640544

原文链接：https://zhuanlan.zhihu.com/p/52930683

521 thoughts on “机器学习评估指标 AUC 综述”

Pingback： Lincoln Georgis
Pingback： Madelyn Monroe Masturbating
Pingback： MILF Porn
Pingback： premium-domain-name
Pingback： Assignment Help 4 Me
Pingback： Homework help online
Pingback： valentines gift
Pingback： valentine gift
Pingback： calming
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： moveit studio
Pingback： Click Here
Pingback： moveit studio
Pingback： Reputation Defenders
Pingback： Reputation Defenders
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Reputation Defenders
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： grand rapids same day crowns
Pingback： grand rapids dentist
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： 안전한카지노사이트
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： Click Here
Pingback： 카지노 게임 온라인
Pingback： premium-domains
Pingback： premium-domain-broker
Pingback： pre seed funding for startups
Pingback： post rent ad for free
Pingback： Google reviews
Pingback： Porno star Australia
Pingback： reputation defenders
Pingback： 2023 Books
Pingback： deceased
Pingback： funeral director
Pingback： funeral director
Pingback： IRA Empire
Pingback： football betting systems
Pingback： Chirurgiens esthétique Tunisie
Pingback： NCNU
Pingback： Faculty expertise
Pingback： ما هي افضل الكليات الخاصه في مصر
Pingback： البحث العلمي
Pingback： الممارسات الأخلاقية
Pingback： Multidisciplinary Courses
Pingback： عيوب كلية الصيدلة
Pingback： Research Activities for pharmacy students at future university
Pingback： قسم طب الأسنان التحفظي
Pingback： نظام الدرجات
Pingback： الهندسة المعمارية
Pingback： FCIT Programs and Courses
Pingback： fue
Pingback： Microbiology and Immunology
Pingback： International student admissions to future university
Pingback： future University application form
Pingback： Future University Egypt MBA
Pingback： الادارة العامة
Pingback： politics and economics
Pingback： pharmacists
Pingback： تصوير الأسنان بالأشعة علاج الجزور
Pingback： كلية طب الأسنان
Pingback： كلية التجارة وإدارة الأعمال
Pingback： Admission Process
Pingback： Political Science degree
Pingback： Department of Pharmaceutics and Pharmaceutical Technology
Pingback： Educational Activities for pharmacy students at future university
Pingback： البحث العلمي المبتكر
Pingback： Advanced Preventive Measures
Pingback： التدريب الهندسي
Pingback： FCIT Programs
Pingback： Computer Science
Pingback： attendance
Pingback： creativity
Pingback： fue
Pingback： Advanced Dental Training
Pingback： Contact Information future university in egypt
Pingback： برامج البكالوريوس في جامعة المستقبل
Pingback： Maillot de football
Pingback： Maillot de football
Pingback： Maillot de football
Pingback： Maillot de football
Pingback： Maillot de football
Pingback： Maillot de football
Pingback： Maillot de football
Pingback： Maillot de football
Pingback： Maillot de football
Pingback： SEOSolutionVIP Fiverr
Pingback： SEOSolutionVIP Fiverr
Pingback： controsoffitto led soggiorno
Pingback： strip led camera da letto
Pingback： pec-deck
Pingback： bodytone
Pingback： liv pure
Pingback： Fiverr Earn
Pingback： Fiverr Earn
Pingback： Fiverr Earn
Pingback： Fiverr Earn
Pingback： Fiverr Earn
Pingback： Fiverr Earn
Pingback： Fiverr Earn
Pingback： Fiverr Earn
Pingback： Fiverr Earn
Pingback： Visualizza i prodotti
Pingback： fiverrearn.com
Pingback： fiverrearn.com
Pingback： fiverrearn.com
Pingback： fiverrearn.com
Pingback： shipping broker
Pingback： shipping broker
Pingback： red boost
Pingback： glucotrust mediprime
Pingback： freight class calculator
Pingback： french bulldog for sale houston
Pingback： fiverrearn.com
Pingback： fiverrearn.com
Pingback： fiverrearn.com
Pingback： french bulldog
Pingback： fiverrearn.com
Pingback： fiverrearn.com
Pingback： french bulldog san francisco
Pingback： what fruit can french bulldogs eat
Pingback： french bulldog acne
Pingback： cavapoo dog
Pingback： chiweenie dog
Pingback： renting golf cart isla mujeres
Pingback： jute rugs
Pingback： Sem
Pingback： Specialized Piano Handling
Pingback： Experienced Piano Movers
Pingback： Top university in Egypt
Pingback： Best university in Egypt
Pingback： Top university in Egypt
Pingback： Top university in Egypt
Pingback： Private universities in Egypt
Pingback： isla mujeres golf carts
Pingback： lilac french bulldog
Pingback： mini frenchie for sale
Pingback： french bulldog puppy for sale
Pingback： cream french bulldog
Pingback： crypto news
Pingback： vietravel tour vietnam
Pingback： teacup french bulldog
Pingback： brindle frenchie
Pingback： french bulldog puppies for sale tx
Pingback： future university
Pingback： future university
Pingback： future university
Pingback： renting golf cart isla mujeres
Pingback： j’adore cowboys
Pingback： daftar multisbo
Pingback： Isla mujeres golf cart rental cost
Pingback： wix login
Pingback： bulldog frenchie puppies
Pingback： Fiverr.Com
Pingback： Fiverr.Com
Pingback： Fiverr.Com
Pingback： fue
Pingback： golf cart rental isla mujeres
Pingback： transportation from cancun to isla mujeres
Pingback： bulldog frenchie puppies
Pingback： french bulldog puppies
Pingback： french bulldog
Pingback： lean six sigma
Pingback： french bulldog san antonio
Pingback： Warranty
Pingback： Secure piano storage
Pingback： Piano maintenance
Pingback： Upright piano storage
Pingback： FUE
Pingback： Furniture handling
Pingback： Safe moving
Pingback： Moving coordination
Pingback： pcfinancial.ca/activate
Pingback： Business administration degrees in Egypt
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： Fiverr
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： Speaker
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： Pupuk terbaik dan terpercaya di pupukanorganik.com
Pingback： partners
Pingback： gluconite reviews
Pingback： Political Science
Pingback： STUDY ABROAD CONSULTANTS THRISSUR
Pingback： prodentim
Pingback： livpure
Pingback： french bulldog
Pingback： Predictions
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： live sex cams
Pingback： live sex cams
Pingback： live sex cams
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： frenchies for sale in texas
Pingback： french bulldog dallas
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： Queen Arwa University
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： serialebi qaerulad
Pingback： ფილმები ქართულად
Pingback： wix login
Pingback： seo company new york
Pingback： shopping cart
Pingback： web design
Pingback： Scientific Research
Pingback： SRA Survivors
Pingback： Kuliah Termurah
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： FiverrEarn
Pingback： Generator Sales Manchester
Pingback： amyl guard scam
Pingback： cheap sex cams
Pingback： live sex cams
Pingback： live sex cams
Pingback： rare breed-trigger
Pingback： problemas fiscales de una empresa en mexico
Pingback： 늑대닷컴
Pingback： Stacked symbol
Pingback： OnePeace Live Action AMV
Pingback： nangs near me
Pingback： superslot
Pingback： freelance web developer Singapore
Pingback： allgame
Pingback： 918kiss
Pingback： หวย24
Pingback： Skincare for dark spots
Pingback： french bulldog with clothes
Pingback： pg slot
Pingback： AI Attorney
Pingback： carte uno reverse
Pingback： cybersécurité
Pingback： Raahe Guide
Pingback： Raahe Guide
Pingback： aplikasi slot tergacor
Pingback： situs slot
Pingback： east wind spa and hotel
Pingback： hotel in lake placid
Pingback： megagame
Pingback： evisa
Pingback： weight loss drops
Pingback： weight loss injection
Pingback： 450 bushmaster ammo
Pingback： itsMasum.Com
Pingback： itsMasum.Com
Pingback： itsMasum.Com
Pingback： formation cybersécurité pôle emploi
Pingback： deux catégories de logiciels malveillants malware
Pingback： Nangs delivery sydney
Pingback： nangs sydney
Pingback： itsmasum.com
Pingback： chat with strangers online
Pingback： free chat
Pingback： itsmasum.com
Pingback： joker gaming
Pingback： Film institutionnel Nantes
Pingback： Film institutionnel Nantes
Pingback： rome jobs
Pingback： gulf jobs central
Pingback： ny jobs central
Pingback： cheap cam sex
Pingback： sex chat
Pingback： cheap cam sex
Pingback： cam sex
Pingback： Kampus Ternama
Pingback： A Yemeni Arab Journal Indexed by Scopus and ISI
Pingback： Queen Arwa University digital identity
Pingback： 918kiss
Pingback： pg slot
Pingback： 918kiss
Pingback： itme.xyz
Pingback： itme.xyz
Pingback： Premium URL Shortener
Pingback： Bokeo Thailand
Pingback： itme.xyz
Pingback： itme.xyz
Pingback： ItMe.Xyz
Pingback： Best URL Shortener To Make Money
Pingback： mzplay
Pingback： wix seo specialist
Pingback： de zaragoza
Pingback： french bulldog puppies for sale $200
Pingback： blue french bulldog
Pingback： cheap french bulldog puppies under $500
Pingback： micro frenchie
Pingback： frenchie puppies for sale california
Pingback： live cam girls
Pingback： in vitro fertilization mexico
Pingback： french bulldog texas
Pingback： houston tx salons
Pingback： floodle
Pingback： how to get my dog papers
Pingback： french bulldog puppies near me
Pingback： fort lee acupuncture
Pingback： atizapán de zaragoza clima
Pingback： cuautitlan izcalli clima
Pingback： atizapán de zaragoza clima
Pingback： cuautitlan izcalli clima
Pingback： atizapán de zaragoza clima
Pingback： clima en chimalhuacan
Pingback： french bulldog adoption
Pingback： liz kerr
Pingback： ivf in cancun mexico
Pingback： Fanuc
Pingback： frenchies for sale in texas
Pingback： بطاقة ايوا
Pingback： webcam sex
Pingback： micro american bullies
Pingback： cancun mexico boat rental
Pingback： dog yorkie mix
Pingback： french bulldog shih tzu mix
Pingback： 라이브스코어
Pingback： 무료스포츠중계
Pingback： 스포츠분석
Pingback： best probiotic for french bulldogs
Pingback： blockchain
Pingback： esports domain
Pingback： french bulldog
Pingback： undetected mw2 cheats
Pingback： apex legends cheats
Pingback： download hwid spoofer
Pingback： condiciones climaticas queretaro
Pingback： black frenchies
Pingback： french bulldogs to rescue
Pingback： 늑대닷컴
Pingback： family ho
Pingback： 늑대닷컴
Pingback： joyce echols
Pingback： massachusetts boston terriers
Pingback： dog probiotic chews on amazon
Pingback： dr kim acupuncture
Pingback： we buy puppies
Pingback： linh hoang
Pingback： french bulldog texas
Pingback： mexican candy store near me
Pingback： french bull
Pingback： gaming
Pingback： crypto news
Pingback： brazilian jiu jitsu cypress tx
Pingback： bjj houston tx
Pingback： french bulldog
Pingback： bjj jiu jitsu magnolia texas
Pingback： clima cancún
Pingback： clima cuautitlán izcalli
Pingback： minnect expert
Pingback： best canine probiotics for bullies
Pingback： french bulldog pug mix
Pingback： french bulldog poodle mix
Pingback： Dog Registry
Pingback： Dog Papers
Pingback： How To Get My Dog Papers
Pingback： Dog Registry
Pingback： Dog Papers
Pingback： Dog Papers
Pingback： Dog Papers
Pingback： Dog Registry
Pingback： Dog Papers
Pingback： Dog Registry
Pingback： Dog Registry
Pingback： sugar land seo company
Pingback： french pitbull
Pingback： french bulldog texas
Pingback： golf cart rentals isla mujeres
Pingback： French Bulldog Adoption
Pingback： French Bulldog Adoption
Pingback： French Bulldog Rescue
Pingback： French Bulldog Adoption
Pingback： French Bulldog Rescue
Pingback： linh hoang
Pingback： clima tultitlán
Pingback： vacation rentals isla mujeres
Pingback： gaming
Pingback： golf cart rental
Pingback： Frenchie Puppies
Pingback： Frenchie Puppies
Pingback： French Bulldog For Sale
Pingback： French Bulldog For Sale
Pingback： Frenchie Puppies
Pingback： French Bulldog For Sale
Pingback： French Bulldog Puppies Near Me
Pingback： French Bulldog For Sale
Pingback： French Bulldog For Sale
Pingback： Frenchie Puppies
Pingback： French Bulldog For Sale
Pingback： probiotic dog treats
Pingback： acupuncture
Pingback： crypto news
Pingback： french bulldog accessories
Pingback： chanel newborn clothes
Pingback： satoshi t shirt
Pingback： nepo hat
Pingback： marfa prada poster
Pingback： need money for porsche shirt
Pingback： frenchie chihuahua mix
Pingback： frenchie boston terrier mix
Pingback： fartcoin crypto
Pingback： antonio villanueva
Pingback： feeria
Pingback： fluffy french bulldog
Pingback： lilac french bulldogs
Pingback： french bulldogs
Pingback： micro french bulldog
Pingback： viet travel
Pingback： dump him shirt
Pingback： micro frenchies
Pingback： in vitro fertilization mexico
Pingback： in vitro fertilization mexico
Pingback： french bulldog puppies san antonio
Pingback： isla mujeres climate
Pingback： french bulldog adoption
Pingback： top french bulldog breeders in the world
Pingback： French Bulldog puppies in Dallas
Pingback： French Bulldog puppies in Austin
Pingback： blue french bulldog
Pingback： coco tennis
Pingback： linh hoang
Pingback： bitcoin
Pingback： dog registry
Pingback： dog registry
Pingback： yacht rentals in cancun mexico
Pingback： how can you get papers on a dog
Pingback： joyce echols
Pingback： miniature bulldog
Pingback： american bully life span
Pingback： ragnarok private server 2025
Pingback： FB URL Shortener
Pingback： wix seo experts
Pingback： wix seo specialists
Pingback： wix seo service
Pingback： wix seo specialists
Pingback： wix seo service
Pingback： free adult webcams
Pingback： live cam girls
Pingback： free sex cams
Pingback： free adult webcams
Pingback： cheap cam sex
Pingback： live sex chat
Pingback： live cam girls
Pingback： rebirth ro
Pingback： casino en ligne canada
Pingback： today sunrise sunset
Pingback： Silicon Valley Best Realtor Arpad Racz
Pingback： kooky

发表评论取消回复

要发表评论，您必须先登录。

521 thoughts on “机器学习评估指标 AUC 综述”

发表评论 取消回复

发表评论取消回复