Article(id=1190284124595261917, tenantId=1146029695717560320, journalId=1189918454225211397, issueId=1190290597995970659, articleNumber=null, orderNo=null, doi=10.20104/j.cnki.1674-6546.20240304, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=research-article, receivedDate=null, receivedDateStr=null, revisedDate=1720108800000, revisedDateStr=2024-07-05, acceptedDate=null, acceptedDateStr=null, onlineDate=1761715449713, onlineDateStr=2025-10-29, pubDate=1755187200000, pubDateStr=2025-08-15, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1761716993551, onlineIssueDateStr=2025-10-29, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=1761715449713, onlineFirstDateStr=2025-10-29, sourceXml=null, magXml=null, createTime=1761715449713, creator=13701087609, updateTime=1761715449713, updator=13701087609, issue=Issue{id=1190290597995970659, tenantId=1146029695717560320, journalId=1189918454225211397, year='2025', volume='', issue='8', pageStart='1', pageEnd='48', issueExtLink='null', onlineDate='null', pubDate='null', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=1, issueType=1, specialIssue=null, createTime=1761716993092, creator=13701087609, updateTime=1761717119370, updator=13701087609, preIssue=null, nextIssue=null, ext={EN=IssueExt(id=1190291127711404379, tenantId=1146029695717560320, journalId=1189918454225211397, issueId=1190290597995970659, language=EN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=), CN=IssueExt(id=1190291127711404380, tenantId=1146029695717560320, journalId=1189918454225211397, issueId=1190290597995970659, language=CN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=)}, issueFiles=null}, startPage=29, endPage=36, ext={EN=ArticleExt(id=1190284124758839775, articleId=1190284124595261917, tenantId=1146029695717560320, journalId=1189918454225211397, language=EN, title=Driver Abnormal Driving Behavior Detection Algorithm Based on Contrast Learning, columnId=null, journalTitle=Automotive Engineer, columnName=null, runingTitle=null, highlight=null, articleAbstract=

In the process of driving a vehicle, the complex and changing environment inside the vehicle, the change of lighting conditions and the diversity of drivers’ behavioral postures affect the detection and recognition of abnormal driver behavior. To address this issue, this paper proposes a driver abnormal driving behavior detection algorithm based on contrast learning. The paper firstly considers driver’s driving behavior detection as a binary classification task, and utilizes a contrast learning approach to compare driver’s normal driving with abnormal driving samples and to improve the performance of the model by contrasting loss functions. Secondly, the depth images right ahead and above the driver serves as inputs to solve the problems of complex in-vehicle environment to change the light intensity and blind spots in viewpoint by providing the depth information of the driver. Finally, 3D convolution is introduced in the lightweight network MobileNetV2, and the operation of channel blending is added to the convolution layer of each bottleneck structure to improve the accuracy of recognition. Test results show that accuracy of the proposed algorithm reaches 94.18% in the Driver’s Abnormality Detection (DAD) dataset and ROC AUC reaches 0.962, which shows the effectiveness of the algorithm in driver’s abnormal behavior detection.

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=null, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=Zhonglun Li, Guangda Yu, Shuai Yang, Shiye Zou, Hequn Zhang, Chunyu Wang), CN=ArticleExt(id=1190284398927909848, articleId=1190284124595261917, tenantId=1146029695717560320, journalId=1189918454225211397, language=CN, title=基于对比学习的驾驶员异常驾驶行为检测算法*, columnId=1190284124830142945, journalTitle=汽车工程师, columnName=智能车辆环境感知与智能决策技术专刊, runingTitle=null, highlight=null, articleAbstract=

针对车辆行驶过程中复杂多变的车内环境和光照条件及驾驶员行为姿态多样性对驾驶员异常行为检测与识别的影响,提出一种基于对比学习的驾驶员异常驾驶行为检测算法。首先,将驾驶员驾驶行为检测视为二分类任务,使用对比学习的方法,将驾驶员正常驾驶与异常驾驶样本进行对比,并通过对比损失函数提高模型的性能。其次,以驾驶员正前方和上方的深度图像作为输入,通过提供驾驶员的深度信息解决车内环境复杂、光照强度变化及视角盲区问题。最后,在轻量化网络MobileNetV2基础上引入3D卷积,并在每个瓶颈结构的卷积层中加入通道混洗操作,提高识别的准确性。试验结果表明,提出的算法在驾驶员异常检测(DAD)数据集测试集中的准确率达到94.18%,受试者操作特征(ROC)曲线下面积(AUC)达到0.962,该算法在驾驶员异常行为检测方面具有有效性。

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=xKNB+wkAvkhDqkVJumZulA==, magXml=PcUD3v+Sb4HbLQO8dzC1Vg==, pdfUrl=null, pdf=GU/fRMQu+A3iMi7+P49xKA==, pdfFileSize=7931848, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=EghHTXsQQvLn+D3qhnAOXQ==, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=aH0AqCaj8cEZlOmKCvEV6g==, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=李仲伦, 于光达, 杨帅, 邹世野, 张鹤群, 汪春雨)}, authors=[Author(id=1194611012092534813, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1194611012168032287, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, authorId=1194611012092534813, language=EN, stringName=Zhonglun Li, firstName=Zhonglun, middleName=null, lastName=Li, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1194611012256112672, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, authorId=1194611012092534813, language=CN, stringName=李仲伦, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=吉林省产品质量监督检验院, 长春 130013, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1194611012021231641, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, xref=null, ext=[AuthorCompanyExt(id=1194611012029620250, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103), AuthorCompanyExt(id=1194611012033814555, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=吉林省产品质量监督检验院, 长春 130013)])]), Author(id=1194611012319027234, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1194611012394524708, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, authorId=1194611012319027234, language=EN, stringName=Guangda Yu, firstName=Guangda, middleName=null, lastName=Yu, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1194611012457439269, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, authorId=1194611012319027234, language=CN, stringName=于光达, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=吉林省产品质量监督检验院, 长春 130013, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1194611012021231641, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, xref=null, ext=[AuthorCompanyExt(id=1194611012029620250, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103), AuthorCompanyExt(id=1194611012033814555, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=吉林省产品质量监督检验院, 长春 130013)])]), Author(id=1194611012520353831, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1194611012604239913, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, authorId=1194611012520353831, language=EN, stringName=Shuai Yang, firstName=Shuai, middleName=null, lastName=Yang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1194611012658765866, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, authorId=1194611012520353831, language=CN, stringName=杨帅, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=吉林省产品质量监督检验院, 长春 130013, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1194611012021231641, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, xref=null, ext=[AuthorCompanyExt(id=1194611012029620250, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103), AuthorCompanyExt(id=1194611012033814555, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=吉林省产品质量监督检验院, 长春 130013)])]), Author(id=1194611012717486124, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1194611012784594990, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, authorId=1194611012717486124, language=EN, stringName=Shiye Zou, firstName=Shiye, middleName=null, lastName=Zou, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1194611012851703855, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, authorId=1194611012717486124, language=CN, stringName=邹世野, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=吉林省产品质量监督检验院, 长春 130013, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1194611012021231641, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, xref=null, ext=[AuthorCompanyExt(id=1194611012029620250, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103), AuthorCompanyExt(id=1194611012033814555, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=吉林省产品质量监督检验院, 长春 130013)])]), Author(id=1194611012910424113, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, orderNo=4, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1194611012981727283, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, authorId=1194611012910424113, language=EN, stringName=Hequn Zhang, firstName=Hequn, middleName=null, lastName=Zhang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1194611013040447540, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, authorId=1194611012910424113, language=CN, stringName=张鹤群, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=吉林省产品质量监督检验院, 长春 130013, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1194611012021231641, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, xref=null, ext=[AuthorCompanyExt(id=1194611012029620250, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103), AuthorCompanyExt(id=1194611012033814555, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=吉林省产品质量监督检验院, 长春 130013)])]), Author(id=1194611013099167798, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, orderNo=5, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1194611013166276664, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, authorId=1194611013099167798, language=EN, stringName=Chunyu Wang, firstName=Chunyu, middleName=null, lastName=Wang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1194611013220802617, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, authorId=1194611013099167798, language=CN, stringName=汪春雨, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=吉林省产品质量监督检验院, 长春 130013, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1194611012021231641, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, xref=null, ext=[AuthorCompanyExt(id=1194611012029620250, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103), AuthorCompanyExt(id=1194611012033814555, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=吉林省产品质量监督检验院, 长春 130013)])])], keywords=[Keyword(id=1194611013346631738, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, orderNo=1, keyword=Abnormal driving behavior detection), Keyword(id=1194611013409546299, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, orderNo=2, keyword=Contrast learning), Keyword(id=1194611013476655164, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, orderNo=3, keyword=Second classification), Keyword(id=1194611013573124157, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, orderNo=4, keyword=3D Convolution Neural Networks (CNN)), Keyword(id=1194611013631844414, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, orderNo=1, keyword=异常驾驶行为检测), Keyword(id=1194611013686370367, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, orderNo=2, keyword=对比学习), Keyword(id=1194611013757673536, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, orderNo=3, keyword=二分类), Keyword(id=1194611013816393793, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, orderNo=4, keyword=3D卷积神经网络)], refs=[Reference(id=1194611016865652832, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2016, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[1], rfOrder=0, authorNames=YANG Y B, ZHOU L J, LENG M J, journalName=2016 International Conference on Intelligent Transportation, Big Data & Smart City (ICITBS), refType=null, unstructuredReference=YANG Y B, ZHOU L J, LENG M J, et al. Early Warning of Traffic Accident in Shanghai Based on Large Data Set Mining[C]// 2016 International Conference on Intelligent Transportation, Big Data & Smart City (ICITBS). Changsha, China: IEEE, 2016., articleTitle=Early Warning of Traffic Accident in Shanghai Based on Large Data Set Mining, refAbstract=null), Reference(id=1194611016924373089, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2016, volume=113, issue=10, pageStart=2636, pageEnd=2641, url=null, language=null, rfNumber=[2], rfOrder=1, authorNames=DINGUS T A, GUO F, LEE S, journalName=Proceedings of the National Academy of Sciences of the United States of America, refType=null, unstructuredReference=DINGUS T A, GUO F, LEE S, et al. Driver Crash Risk Factors and Prevalence Evaluation Using Naturalistic Driving Data[J]. Proceedings of the National Academy of Sciences of the United States of America, 2016, 113(10): 2636-2641., articleTitle=Driver Crash Risk Factors and Prevalence Evaluation Using Naturalistic Driving Data, refAbstract=null), Reference(id=1194611016983093346, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2020, volume=20, issue=14, pageStart=5702, pageEnd=5708, url=null, language=null, rfNumber=[3], rfOrder=2, authorNames=陈军, 张黎, 周博, journalName=科学技术与工程, refType=null, unstructuredReference=陈军, 张黎, 周博, 等. 基于级联卷积神经网络的驾驶员分心驾驶行为检测[J]. 科学技术与工程, 2020, 20(14): 5702-5708., articleTitle=基于级联卷积神经网络的驾驶员分心驾驶行为检测, refAbstract=null), Reference(id=1194611017046007907, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2020, volume=20, issue=14, pageStart=5702, pageEnd=5708, url=null, language=null, rfNumber=[3], rfOrder=3, authorNames=CHEN J, ZHANG L, ZHOU B, journalName=Science Technology and Engineering, refType=null, unstructuredReference=CHEN J, ZHANG L, ZHOU B, et al. Driver Distracted Driving Behavior Detection Based on Cascaded Convolutional Neural Networks[J]. Science Technology and Engineering, 2020, 20(14): 5702-5708., articleTitle=Driver Distracted Driving Behavior Detection Based on Cascaded Convolutional Neural Networks, refAbstract=null), Reference(id=1194611017104728164, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2014, volume=null, issue=null, pageStart=344, pageEnd=349, url=null, language=null, rfNumber=[4], rfOrder=4, authorNames=TAWARI A, TRIVEDI M M, journalName=2014 IEEE Intelligent Vehicles Symposium Proceedings, refType=null, unstructuredReference=TAWARI A, TRIVEDI M M. Robust and Continuous Estimation of Driver Gaze Zone by Dynamic Analysis of Multiple Face Videos[C]// 2014 IEEE Intelligent Vehicles Symposium Proceedings. Dearborn, Michigan, USA: IEEE, 2014: 344-349., articleTitle=Robust and Continuous Estimation of Driver Gaze Zone by Dynamic Analysis of Multiple Face Videos, refAbstract=null), Reference(id=1194611017167642725, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2022, volume=58, issue=1, pageStart=115, pageEnd=134, url=null, language=null, rfNumber=[5], rfOrder=5, authorNames=高菲, 杨柳, 李晖, journalName=南京大学学报(自然科学), refType=null, unstructuredReference=高菲, 杨柳, 李晖. 开放集识别研究综述[J]. 南京大学学报(自然科学), 2022, 58(1): 115-134., articleTitle=开放集识别研究综述, refAbstract=null), Reference(id=1194611017226362982, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2022, volume=58, issue=1, pageStart=115, pageEnd=134, url=null, language=null, rfNumber=[5], rfOrder=6, authorNames=GAO F, YANG L, LI H, journalName=Journal of Nanjing University (Natural Science), refType=null, unstructuredReference=GAO F, YANG L, LI H. A Review of Open Set Identification Research[J]. Journal of Nanjing University (Natural Science), 2022, 58(1): 115-134., articleTitle=A Review of Open Set Identification Research, refAbstract=null), Reference(id=1194611017289277543, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2023, volume=19, issue=3, pageStart=2574, pageEnd=2584, url=null, language=null, rfNumber=[6], rfOrder=7, authorNames=CHEN J M, WANG G J, LÜ J C, journalName=IEEE Transactions on Industrial Informatics, refType=null, unstructuredReference=CHEN J M, WANG G J, J C, et al. Open-Set Classification for Signal Diagnosis of Machinery Sensor in Industrial Environment[J]. IEEE Transactions on Industrial Informatics, 2023, 19(3): 2574-2584., articleTitle=Open-Set Classification for Signal Diagnosis of Machinery Sensor in Industrial Environment, refAbstract=null), Reference(id=1194611017364775016, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2022, volume=43, issue=10, pageStart=12, pageEnd=25, url=null, language=null, rfNumber=[7], rfOrder=8, authorNames=王一丰, 郭渊博, 陈庆礼, journalName=通信学报, refType=null, unstructuredReference=王一丰, 郭渊博, 陈庆礼, 等. 基于对比学习的细粒度未知恶意流量分类方法[J]. 通信学报, 2022, 43(10): 12-25., articleTitle=基于对比学习的细粒度未知恶意流量分类方法, refAbstract=null), Reference(id=1194611017427689577, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2022, volume=43, issue=10, pageStart=12, pageEnd=25, url=null, language=null, rfNumber=[7], rfOrder=9, authorNames=WANG Y F, GUO Y B, CHEN Q L, journalName=Journal on Communications, refType=null, unstructuredReference=WANG Y F, GUO Y B, CHEN Q L, et al. A Fine-Grained Approach to Classify Unknown Malicious Traffic Based on Comparative Learning[J]. Journal on Communications, 2022, 43(10): 12-25., articleTitle=A Fine-Grained Approach to Classify Unknown Malicious Traffic Based on Comparative Learning, refAbstract=null), Reference(id=1194611017494798442, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=1597, pageEnd=1607, url=null, language=null, rfNumber=[8], rfOrder=10, authorNames=CHEN T, KORNBLITH S, NOROUZI M, journalName=Proceedings of the 37th International Conference on Machine Learning, refType=null, unstructuredReference=CHEN T, KORNBLITH S, NOROUZI M, et al. A Simple Framework for Contrastive Learning of Visual Representations[C]// Proceedings of the 37th International Conference on Machine Learning. Vienna, Austria: PMLR, 2020: 1597-1607., articleTitle=A Simple Framework for Contrastive Learning of Visual Representations, refAbstract=null), Reference(id=1194611017566101611, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=21271, pageEnd=21284, url=null, language=null, rfNumber=[9], rfOrder=11, authorNames=GRILL J B, STRUB F, ALTCHÉ F, journalName=Conference on Neural Information Processing Systems (Neur IPS 2020), refType=null, unstructuredReference=GRILL J B, STRUB F, ALTCHÉ F, et al. Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning[C]// Conference on Neural Information Processing Systems (Neur IPS 2020). Online: Neur IPS, 2020: 21271-21284., articleTitle=Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning, refAbstract=null), Reference(id=1194611017637404780, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=9912, pageEnd=9924, url=null, language=null, rfNumber=[10], rfOrder=12, authorNames=CARON M, MISRA I, MAIRAL J, journalName=Conference on Neural Information Processing Systems (Neur IPS 2020), refType=null, unstructuredReference=CARON M, MISRA I, MAIRAL J, et al. Unsupervised Learning of Visual Features by Contrasting Cluster Assignments[C]// Conference on Neural Information Processing Systems (Neur IPS 2020). Online: Neur IPS, 2020: 9912-9924., articleTitle=Unsupervised Learning of Visual Features by Contrasting Cluster Assignments, refAbstract=null), Reference(id=1194611017696125037, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2018, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[11], rfOrder=13, authorNames=ZHANG H Y, CISSE M, DAUPHIN Y N, journalName=International Conference on Learning Representations, refType=null, unstructuredReference=ZHANG H Y, CISSE M, DAUPHIN Y N, et al. Mixup: Beyond Empirical Risk Minimization[C]// International Conference on Learning Representations. Vancouver, Canada: ICLR, 2018., articleTitle=Mixup: Beyond Empirical Risk Minimization, refAbstract=null), Reference(id=1194611017754845294, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=6022, pageEnd=6031, url=null, language=null, rfNumber=[12], rfOrder=14, authorNames=YUN S D, HAN D Y, CHUN S H, journalName=International Conference on Computer Vision (ICCV 2019), refType=null, unstructuredReference=YUN S D, HAN D Y, CHUN S H, et al. CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features[C]// International Conference on Computer Vision (ICCV 2019). Seoul, Korea: IEEE/CVF, 2019: 6022-6031., articleTitle=CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features, refAbstract=null), Reference(id=1194611017813565551, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=null, pageStart=91, pageEnd=100, url=null, language=null, rfNumber=[13], rfOrder=15, authorNames=KOPUKLU O, ZHENG J P, XU H, journalName=2021 IEEE Winter Conference on Applications of Computer Vision, refType=null, unstructuredReference=KOPUKLU O, ZHENG J P, XU H, et al. Driver Anomaly Detection: A Dataset and Contrastive Learning Approach[C]// 2021 IEEE Winter Conference on Applications of Computer Vision. Waikoloa, HI, USA: IEEE, 2021: 91-100., articleTitle=Driver Anomaly Detection: A Dataset and Contrastive Learning Approach, refAbstract=null), Reference(id=1194611017872285808, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2017, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[14], rfOrder=16, authorNames=IANDOLA F N, HAN S, MOSKEWICZ M W, journalName=International Conference on Learning Representations, refType=null, unstructuredReference=IANDOLA F N, HAN S, MOSKEWICZ M W, et al. SqueezeNet: AlexNet-Level Accuracy with 50x Fewer Parameters and <0.5MB Model Size[C]// International Conference on Learning Representations. Toulon, France: ICLR, 2017., articleTitle=SqueezeNet: AlexNet-Level Accuracy with 50x Fewer Parameters and <0.5MB Model Size, refAbstract=null), Reference(id=1194611017935200369, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2018, volume=null, issue=null, pageStart=6848, pageEnd=6856, url=null, language=null, rfNumber=[15], rfOrder=17, authorNames=ZHANG X Y, ZHOU X Y, LIN M X, journalName=2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018), refType=null, unstructuredReference=ZHANG X Y, ZHOU X Y, LIN M X, et al. ShuffleNet:An Extremely Efficient Convolutional Neural Network for Mobile Devices[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018). Salt Lake City, Utah, USA: IEEE/CVF, 2018: 6848-6856., articleTitle=ShuffleNet:An Extremely Efficient Convolutional Neural Network for Mobile Devices, refAbstract=null), Reference(id=1194611017998114930, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=6105, pageEnd=6114, url=null, language=null, rfNumber=[16], rfOrder=18, authorNames=TAN M X, LE Q V, journalName=International Conference on Machine Learning, refType=null, unstructuredReference=TAN M X, LE Q V. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks[C]// International Conference on Machine Learning. Long Beach, California, USA: PMLR, 2019: 6105-6114., articleTitle=EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks, refAbstract=null), Reference(id=1194611018065223795, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2018, volume=null, issue=null, pageStart=8697, pageEnd=8710, url=null, language=null, rfNumber=[17], rfOrder=19, authorNames=ZOPH B, VASUDEVAN V, SHLENS J, journalName=2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018), refType=null, unstructuredReference=ZOPH B, VASUDEVAN V, SHLENS J, et al. Learning Transferable Architectures for Scalable Image Recognition[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018). Salt Lake City, Utah, USA: IEEE/CVF, 2018: 8697-8710., articleTitle=Learning Transferable Architectures for Scalable Image Recognition, refAbstract=null), Reference(id=1194611018136526964, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=10734, pageEnd=10742, url=null, language=null, rfNumber=[18], rfOrder=20, authorNames=WU B C, DAI X L, ZHANG P Z, journalName=2019 IEEE/CVP Conference on Computer Vision and Pattern Recognition (CVPR 2019), refType=null, unstructuredReference=WU B C, DAI X L, ZHANG P Z, et al. FBNet:Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search[C]// 2019 IEEE/CVP Conference on Computer Vision and Pattern Recognition (CVPR 2019). Long Beach, California, USA: IEEE/CVP, 2019: 10734-10742., articleTitle=FBNet:Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search, refAbstract=null), Reference(id=1194611018199441525, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2018, volume=null, issue=null, pageStart=4510, pageEnd=4520, url=null, language=null, rfNumber=[19], rfOrder=21, authorNames=SANDLER M, HOWARD A, ZHU M, journalName=2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018), refType=null, unstructuredReference=SANDLER M, HOWARD A, ZHU M, et al. MobileNetV2: Inverted Residuals and Linear Bottlenecks[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018). Salt Lake City, Utah, USA: IEEE/CVF, 2018: 4510-4520., articleTitle=MobileNetV2: Inverted Residuals and Linear Bottlenecks, refAbstract=null), Reference(id=1194611018262356086, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2023, volume=46, issue=1, pageStart=147, pageEnd=153, url=null, language=null, rfNumber=[20], rfOrder=22, authorNames=张东, 高丙朋, journalName=山西大学学报(自然科学版), refType=null, unstructuredReference=张东, 高丙朋. 基于改进MobileNet网络的人脸识别方法[J]. 山西大学学报(自然科学版), 2023, 46(1): 147-153., articleTitle=基于改进MobileNet网络的人脸识别方法, refAbstract=null), Reference(id=1194611018321076343, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, doi=null, pmid=null, pmcid=null, year=2023, volume=46, issue=1, pageStart=147, pageEnd=153, url=null, language=null, rfNumber=[20], rfOrder=23, authorNames=ZHANG D, GAO B P, journalName=Journal of Shanxi University (Natural Science Edition), refType=null, unstructuredReference=ZHANG D, GAO B P. Facial Recognition Method Based on Improved MobileNet Network[J]. Journal of Shanxi University (Natural Science Edition), 2023, 46(1): 147-153., articleTitle=Facial Recognition Method Based on Improved MobileNet Network, refAbstract=null)], funds=[Fund(id=1194611016710463583, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, awardId=2023MK008, language=CN, fundingSource=*吉林省市场监督管理厅科技计划项目(2023MK008), fundOrder=null, country=null)], companyList=[AuthorCompany(id=1194611012021231641, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, xref=null, ext=[AuthorCompanyExt(id=1194611012029620250, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103), AuthorCompanyExt(id=1194611012033814555, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, companyId=1194611012021231641, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=吉林省产品质量监督检验院, 长春 130013)])], figs=[ArticleFig(id=1194611013988360259, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=nBdXKPNner3m3YNH1SKqRQ==, figureFileBig=4rQcOi2iwOWG1vVBUFKIEw==, tableContent=null), ArticleFig(id=1194611014051274820, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=图1, caption=闭集识别与开集识别, figureFileSmall=nBdXKPNner3m3YNH1SKqRQ==, figureFileBig=4rQcOi2iwOWG1vVBUFKIEw==, tableContent=null), ArticleFig(id=1194611014126772293, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=QC+5AnQAf/T80SnD6djAWg==, figureFileBig=hKOJIu1EBYPMNNfKSk5Zyg==, tableContent=null), ArticleFig(id=1194611014189686854, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=图2, caption=基于对比学习算法识别正常驾驶、异常驾驶行为, figureFileSmall=QC+5AnQAf/T80SnD6djAWg==, figureFileBig=hKOJIu1EBYPMNNfKSk5Zyg==, tableContent=null), ArticleFig(id=1194611014256795719, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=5iGbkUn3K4xn1mrBP0aalQ==, figureFileBig=cfNqsYF/oeWMZTGW19f/dQ==, tableContent=null), ArticleFig(id=1194611014319710280, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=图3, caption=驾驶员异常驾驶行为检测的对比学习框架, figureFileSmall=5iGbkUn3K4xn1mrBP0aalQ==, figureFileBig=cfNqsYF/oeWMZTGW19f/dQ==, tableContent=null), ArticleFig(id=1194611014869164105, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=KcG6NW+yWA4pZliSECoKKg==, figureFileBig=oUT6zETd1Xtpo/2ucbrSFw==, tableContent=null), ArticleFig(id=1194611014948855882, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=图4, caption=2D卷积与3D卷积的区别, figureFileSmall=KcG6NW+yWA4pZliSECoKKg==, figureFileBig=oUT6zETd1Xtpo/2ucbrSFw==, tableContent=null), ArticleFig(id=1194611015020159051, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=doiTycJwkRGH4mF+yHKunA==, figureFileBig=TADXZpkrHknt/LvFr2jI7Q==, tableContent=null), ArticleFig(id=1194611015099850828, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=图5, caption=3D-MobileNetV2模型结构, figureFileSmall=doiTycJwkRGH4mF+yHKunA==, figureFileBig=TADXZpkrHknt/LvFr2jI7Q==, tableContent=null), ArticleFig(id=1194611015162765389, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=T1JVH7TpwWXJ3ynqQCeGVQ==, figureFileBig=c7YYJz5BO9Px3gPK0U4FbA==, tableContent=null), ArticleFig(id=1194611015263428686, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=图6, caption=不同步长的MobileNetV2模块结构, figureFileSmall=T1JVH7TpwWXJ3ynqQCeGVQ==, figureFileBig=c7YYJz5BO9Px3gPK0U4FbA==, tableContent=null), ArticleFig(id=1194611015330537551, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=5hHDftZjKUOvncutitu7LQ==, figureFileBig=AGMerkAL0zQfmbl6Gf2Mkw==, tableContent=null), ArticleFig(id=1194611015393452112, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=图7, caption=正常驾驶的可视化结果, figureFileSmall=5hHDftZjKUOvncutitu7LQ==, figureFileBig=AGMerkAL0zQfmbl6Gf2Mkw==, tableContent=null), ArticleFig(id=1194611015485726801, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=yFKtiiPwsWrefOKvU387PA==, figureFileBig=OJnUrOZOtZitRiJq/wMDUA==, tableContent=null), ArticleFig(id=1194611015561224274, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=图8, caption=异常驾驶的可视化结果, figureFileSmall=yFKtiiPwsWrefOKvU387PA==, figureFileBig=OJnUrOZOtZitRiJq/wMDUA==, tableContent=null), ArticleFig(id=1194611015636721747, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=XMsrgJrvPuKTsOLcp0nYtA==, figureFileBig=917qmsoLA3DOjlLy6jeqtA==, tableContent=null), ArticleFig(id=1194611015716413524, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=图9, caption=新增异常驾驶行为的可视化结果, figureFileSmall=XMsrgJrvPuKTsOLcp0nYtA==, figureFileBig=917qmsoLA3DOjlLy6jeqtA==, tableContent=null), ArticleFig(id=1194611015783522389, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
网络层/步长 重复次数/次 输出大小
输入层 1 3×16×112×112
卷积层1(3×3×3)/s(1,2,2) 1 32×16×56×56
模块1/s(1,1,1) 1 16×16×56×56
模块2/s(2,2,2) 2 24×8×28×28
模块3/s(2,2,2) 3 32×4×14×14
模块4/s(2,2,2) 4 64×2×7×7
模块5/s(1,1,1) 3 96×2×7×7
模块6/s(2,2,2) 3 160×1×4×4
模块7/s(1,1,1) 1 320×1×4×4
卷积层2(1×1×1)/s(1,1,1) 1 1 280×1×4×4
平均池化层/s(1,1,1) 1 1 024×1×1×1
全连接层 1 类别数量
), ArticleFig(id=1194611015871602774, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=表1, caption=

3D-MobileNetV2体系架构

, figureFileSmall=null, figureFileBig=null, tableContent=
网络层/步长 重复次数/次 输出大小
输入层 1 3×16×112×112
卷积层1(3×3×3)/s(1,2,2) 1 32×16×56×56
模块1/s(1,1,1) 1 16×16×56×56
模块2/s(2,2,2) 2 24×8×28×28
模块3/s(2,2,2) 3 32×4×14×14
模块4/s(2,2,2) 4 64×2×7×7
模块5/s(1,1,1) 3 96×2×7×7
模块6/s(2,2,2) 3 160×1×4×4
模块7/s(1,1,1) 1 320×1×4×4
卷积层2(1×1×1)/s(1,1,1) 1 1 280×1×4×4
平均池化层/s(1,1,1) 1 1 024×1×1×1
全连接层 1 类别数量
), ArticleFig(id=1194611015955488855, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
卷积层卷积核(输入) 后续网络(输出)
1×1×C BatchNorm+Mish
3×3×C BatchNorm+Mish
1×1×C BatchNorm
), ArticleFig(id=1194611016030986328, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=表2, caption=

3D-MobileNetV2的Bottleneck结构

, figureFileSmall=null, figureFileBig=null, tableContent=
卷积层卷积核(输入) 后续网络(输出)
1×1×C BatchNorm+Mish
3×3×C BatchNorm+Mish
1×1×C BatchNorm
), ArticleFig(id=1194611016098095193, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
模型 参数量/×106 准确率/% AUC
ShuffleNetV2 4.4 84.84 0.864
ResNet-18 11.2 89.46 0.883
MobileNetV2 3.4 87.62 0.904
3D-MobileNetV2 5.5 93.16 0.960
本文网络 5.5 94.18 0.962
), ArticleFig(id=1194611016186175578, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=表3, caption=

不同CNN的参数量、准确率、AUC比较

, figureFileSmall=null, figureFileBig=null, tableContent=
模型 参数量/×106 准确率/% AUC
ShuffleNetV2 4.4 84.84 0.864
ResNet-18 11.2 89.46 0.883
MobileNetV2 3.4 87.62 0.904
3D-MobileNetV2 5.5 93.16 0.960
本文网络 5.5 94.18 0.962
), ArticleFig(id=1194611016270061659, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
视角 准确率/% AUC
前置视角 86.75 0.886
顶部视角 88.23 0.914
前置+顶部 94.18 0.962
), ArticleFig(id=1194611016358142044, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=表4, caption=

不同视角的试验结果

, figureFileSmall=null, figureFileBig=null, tableContent=
视角 准确率/% AUC
前置视角 86.75 0.886
顶部视角 88.23 0.914
前置+顶部 94.18 0.962
), ArticleFig(id=1194611016446222429, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
损失函数 准确率/% AUC
交叉熵损失 85.74 0.876
加权交叉熵损失 89.12 0.907
对比损失 94.18 0.962
), ArticleFig(id=1194611016521719902, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1190284124595261917, language=CN, label=表5, caption=

不同损失函数的试验结果

, figureFileSmall=null, figureFileBig=null, tableContent=
损失函数 准确率/% AUC
交叉熵损失 85.74 0.876
加权交叉熵损失 89.12 0.907
对比损失 94.18 0.962
)], attaches=null, journal=Journal(id=1189918244568731652, delFlag=0, nameCn=汽车工程师, nameEn=Automotive Engineer, nameHistory1=null, nameHistory2=null, issn=1674-6546, eissn=null, cn=22-1432/U, coden=null, periodic=0, language=CN, oaType=null, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=null, officePhone=null, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=+bJsKkKt/pjz9u6EwhnksQ==, journalPrice=null, startedYear=null, abbrevIsoEn=null, journalRemark=null, publicationField=null, createdTime=1761628217121, updatedTime=1761735708780, createdBy=18614031015, updatedBy=13701087609, firstLetterCn=A, firstLetterEn=A, subjectCode=Engineering, subjectName=Engineering, subjectCodeEn=Engineering, subjectNameEn=null, picCn=+bJsKkKt/pjz9u6EwhnksQ==, picEn=O3Sn3tnYYrh/jm6emnnMWA==, jcr=null, cjcr=null, exts=[JournalExt(id=1190369097415233706, language=CN, name=汽车工程师, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1761735708812, updatedTime=1761735708812, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://tjqc.cbpt.cnki.net/index.aspx?t=1, submissionEditorUrl=https://tjqc.cbpt.cnki.net/index.aspx?t=3, submissionReviewUrl=https://tjqc.cbpt.cnki.net/index.aspx?t=2, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""}), JournalExt(id=1190369097553645739, language=EN, name=Automotive Engineer, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1761735708845, updatedTime=1761735708845, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://tjqc.cbpt.cnki.net/index.aspx?t=1, submissionEditorUrl=https://tjqc.cbpt.cnki.net/index.aspx?t=3, submissionReviewUrl=https://tjqc.cbpt.cnki.net/index.aspx?t=2, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""})], databaseList=null, tenantJournalId=1189918454225211397, websiteList=[Website(id=1189918982430847716, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1189918454225211397, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/qcgcs/CN, language=CN, createTime=1761628393037, createBy=18614031015, updateTime=1761628422913, updateBy=18614031015, name=汽车工程师-中文, tplId=1146099689490845704, title=汽车工程师, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1189919800185917791, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=articleTextType, value=kx, createTime=1761628588005, updateTime=1761628588005, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800164946268, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=banner, value=null, createTime=1761628588000, updateTime=1761628588000, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800211083618, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=grayFlag, value=0, createTime=1761628588011, updateTime=1761628588011, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800156557659, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=logo, value=https://castjournals.cast.org.cn/joweb/qcgcs/CN/file/pic?fileId=yiZ96RYoYcnGnRMuWdmkWA==, createTime=1761628587998, updateTime=1761628587998, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800223666532, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=minRunFlag, value=0, createTime=1761628588014, updateTime=1761628588014, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800181723486, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/qcgcs/CN/file/pic, createTime=1761628588004, updateTime=1761628588004, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800215277923, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=silenceFlag, value=0, createTime=1761628588012, updateTime=1761628588012, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800173334877, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1761628588002, updateTime=1761628588002, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800194306400, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=themeColor, value=null, createTime=1761628588007, updateTime=1761628588007, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800202695009, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=themeStyle, value=null, createTime=1761628588009, updateTime=1761628588009, creator=18614031015, updator=18614031015)]), Website(id=1189918982527316711, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1189918454225211397, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/qcgcs/EN, language=EN, createTime=1761628393061, createBy=18614031015, updateTime=1761628543075, updateBy=18614031015, name=汽车工程师-英文, tplId=1146101810881728533, title=Automotive Engineer, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1189919837561352952, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=articleTextType, value=kx, createTime=1761628596916, updateTime=1761628596916, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837540381429, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=banner, value=null, createTime=1761628596911, updateTime=1761628596911, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837582324475, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=grayFlag, value=0, createTime=1761628596921, updateTime=1761628596921, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837527798516, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=logo, value=https://castjournals.cast.org.cn/joweb/qcgcs/EN/file/pic?fileId=yiZ96RYoYcnGnRMuWdmkWA==, createTime=1761628596908, updateTime=1761628596908, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837594907389, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=minRunFlag, value=0, createTime=1761628596924, updateTime=1761628596924, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837557158647, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/qcgcs/EN/file/pic, createTime=1761628596915, updateTime=1761628596915, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837586518780, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=silenceFlag, value=0, createTime=1761628596922, updateTime=1761628596922, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837548770038, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1761628596913, updateTime=1761628596913, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837569741561, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=themeColor, value=null, createTime=1761628596918, updateTime=1761628596918, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837573935866, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=themeStyle, value=null, createTime=1761628596919, updateTime=1761628596919, creator=18614031015, updator=18614031015)])], journalTitle=汽车工程师, weixinUrl=null, journalUrl=https://tjqc.cbpt.cnki.net/, iacademicId=null, status=1, seqNo=null, journalTitleEn=Automotive Engineer, journalPhotoCn=+bJsKkKt/pjz9u6EwhnksQ==, journalPhotoEn=O3Sn3tnYYrh/jm6emnnMWA==, journalFirstLetter=A, journalRecommend=null, journalNew=null, journalCollection=null, jcrJf=null, cjcrJf=null, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false), detailUrlCn=https://castjournals.cast.org.cn/joweb/qcgcs/CN/10.20104/j.cnki.1674-6546.20240304, detailUrlEn=https://castjournals.cast.org.cn/joweb/qcgcs/EN/10.20104/j.cnki.1674-6546.20240304, pdfUrlCn=https://castjournals.cast.org.cn/joweb/qcgcs/CN/PDF/10.20104/j.cnki.1674-6546.20240304, pdfUrlEn=https://castjournals.cast.org.cn/joweb/qcgcs/EN/PDF/10.20104/j.cnki.1674-6546.20240304, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, reference=null)
收藏切换
基于对比学习的驾驶员异常驾驶行为检测算法*
收藏切换
PDF下载
李仲伦 , 于光达 , 杨帅 , 邹世野 , 张鹤群 , 汪春雨
汽车工程师 | 智能车辆环境感知与智能决策技术专刊 2025,(8): 29-36
收起
收藏切换
汽车工程师 | 智能车辆环境感知与智能决策技术专刊 2025, (8): 29-36
基于对比学习的驾驶员异常驾驶行为检测算法*
全屏
李仲伦, 于光达, 杨帅, 邹世野, 张鹤群, 汪春雨
作者信息
  • 吉林省产品质量监督检验院, 长春 130013
Driver Abnormal Driving Behavior Detection Algorithm Based on Contrast Learning
Zhonglun Li, Guangda Yu, Shuai Yang, Shiye Zou, Hequn Zhang, Chunyu Wang
Affiliations
  • Jilin Province Product Quality Supervision and Inspection Institute, Changchun 130103
出版时间: 2025-08-15 doi: 10.20104/j.cnki.1674-6546.20240304
文章导航
收藏切换

针对车辆行驶过程中复杂多变的车内环境和光照条件及驾驶员行为姿态多样性对驾驶员异常行为检测与识别的影响,提出一种基于对比学习的驾驶员异常驾驶行为检测算法。首先,将驾驶员驾驶行为检测视为二分类任务,使用对比学习的方法,将驾驶员正常驾驶与异常驾驶样本进行对比,并通过对比损失函数提高模型的性能。其次,以驾驶员正前方和上方的深度图像作为输入,通过提供驾驶员的深度信息解决车内环境复杂、光照强度变化及视角盲区问题。最后,在轻量化网络MobileNetV2基础上引入3D卷积,并在每个瓶颈结构的卷积层中加入通道混洗操作,提高识别的准确性。试验结果表明,提出的算法在驾驶员异常检测(DAD)数据集测试集中的准确率达到94.18%,受试者操作特征(ROC)曲线下面积(AUC)达到0.962,该算法在驾驶员异常行为检测方面具有有效性。

异常驾驶行为检测  /  对比学习  /  二分类  /  3D卷积神经网络

In the process of driving a vehicle, the complex and changing environment inside the vehicle, the change of lighting conditions and the diversity of drivers’ behavioral postures affect the detection and recognition of abnormal driver behavior. To address this issue, this paper proposes a driver abnormal driving behavior detection algorithm based on contrast learning. The paper firstly considers driver’s driving behavior detection as a binary classification task, and utilizes a contrast learning approach to compare driver’s normal driving with abnormal driving samples and to improve the performance of the model by contrasting loss functions. Secondly, the depth images right ahead and above the driver serves as inputs to solve the problems of complex in-vehicle environment to change the light intensity and blind spots in viewpoint by providing the depth information of the driver. Finally, 3D convolution is introduced in the lightweight network MobileNetV2, and the operation of channel blending is added to the convolution layer of each bottleneck structure to improve the accuracy of recognition. Test results show that accuracy of the proposed algorithm reaches 94.18% in the Driver’s Abnormality Detection (DAD) dataset and ROC AUC reaches 0.962, which shows the effectiveness of the algorithm in driver’s abnormal behavior detection.

Abnormal driving behavior detection  /  Contrast learning  /  Second classification  /  3D Convolution Neural Networks (CNN)
李仲伦, 于光达, 杨帅, 邹世野, 张鹤群, 汪春雨. 基于对比学习的驾驶员异常驾驶行为检测算法*. 汽车工程师, 2025 , (8) : 29 -36 . DOI: 10.20104/j.cnki.1674-6546.20240304
Zhonglun Li, Guangda Yu, Shuai Yang, Shiye Zou, Hequn Zhang, Chunyu Wang. Driver Abnormal Driving Behavior Detection Algorithm Based on Contrast Learning[J]. Automotive Engineer, 2025 , (8) : 29 -36 . DOI: 10.20104/j.cnki.1674-6546.20240304
根据美国国家公路交通安全管理局(National Highway Traffic Safety Administration,NHTSA)的数据[1],80%的交通事故和16%的公路死亡是由驾驶员分心驾驶引起的。驾驶员异常驾驶行为(如疲劳、注意力分散等)易导致道路事故的发生,造成生命及财产损失[2]。因此,对驾驶员的驾驶行为进行实时监测及提醒,可有效减少交通事故的发生。
驾驶员异常驾驶行为检测的传统方法主要有3种:基于生理信号的检测算法,驾驶员需佩戴生理信号采集装置以便将生理信号与疲劳状态相关联,该方法会令驾驶员产生不适;基于图像处理的检测算法,通过对图像特征的提取和分析实现检测,该方法会受到图像质量、角度、遮挡等因素影响,导致检测精度降低;基于驾驶操作行为的检测算法,该方法可能受到驾驶员个人习惯和行为的影响,无法检测某些类型的异常驾驶行为。
近年来,基于深度学习的行为检测方法被提出。陈军等[3]设计了一种级联卷积神经网络检测结构,并采用超分辨率测试序列模型对驾驶员分心驾驶行为进行检测。Tawari[4]等提出了一种分布式相机框架,用于收集头部姿势的动态信息以估计驾驶员的视线方向。上述检测方法具有实时性强、准确率高、无需接触身体等优点,但仍有许多问题需要解决,如易受光照强度变化的影响、对新情况的适应能力差等。
本文针对现有研究存在的问题,通过对比学习的方法区分驾驶员正常驾驶和异常驾驶行为,实现驾驶员异常驾驶行为检测。
深度神经网络在模型训练过程中需大量标记样本作为支撑,但真实驾驶场景复杂多变,获得所有类别的标记样本并不现实[5]。开集识别(Open Set Recognition,OSR)将已知类的数据识别为具体类别,将不属于已知类的数据识别为未知类别。开集识别与闭集识别的对比[6]图1所示,在闭集识别中,算法根据训练集中已有的样本,为每个类划分相应的空间,未知的类别会被错误地分到一个类别;在开集识别中,算法为每个类确定一个与之关联的有限区域,可以产生新的类,并检测出异常。
在传统的分类任务中,模型只需对已知的类别进行分类。在开集识别中,模型还需对未知类别进行分类,这对模型的泛化能力提出了更高的要求。
对比学习是一种基于比较的算法,通过将相似的样本归为同一类别、将不相似的样本归为不同类别进行分类,可应用于开集识别。本文中所有与正常样本不相似的样本均被定义为异常样本。
对比学习属于表征学习,目的是学习一种数据变换方式[7],它通过比较数据的不同视图或变化来学习数据的特征表示。对比学习的基础是包含正样本和负样本的成对样本,其中,正样本属于同一类别或具有相似特征,负样本属于不同类别或具有不同特征。本文的目的是在训练期间学习正常驾驶模板向量vn,测试时,任何超过阈值γ的行为均被认为是异常驾驶行为,如图2所示。
基于对比学习的模型,如SimCLR(Simple framework for Contrastive Learning of visual Representations)[8]、BYOL(Bootstrap Your Own Latent)[9]、SwAV(Swapping Assignments between multiple Views)[10]等,在行为识别领域取得了不错的效果。基于对比学习的数据增强方法,如Mixup[11]、CutMix[12]等,可以提高行为识别模型的性能。本文使用的对比学习框架[13]图3所示。
框架主要由3个部分组成:
a. 编码器。神经网络编码器fθ( )将数据视图映射到特征空间,得到特征向量,并且该特征向量能够很好地表示输入数据的语义信息。在训练过程中,使用对比损失函数训练编码器,使同一输入、不同增强版本的特征向量更加相似,不同输入的特征向量区别更加明显。因此,编码器学习到的特征表示更具判别性,从而提高模型在分类任务中的性能。本文fθ( )采用具有参数θ的3D卷积神经网络(Convolutional Neural Networks,CNN)结构,通过3D CNN用hi=fθ(xi)将输入xi转化为hiR512
b. 投影头。投影头是具有一个由线性整流函数(Rectified Linear Unit,ReLU)激活的隐藏层的多层感知机(Multilayer Perceptron,MLP),投影头gβ( )用于将hi映射到另一个潜在空间vi,并且具有参数β以实现vi=gβ(hi)的变换,其中viR128。本文MLP由2个全连接(Fully Connected,FC)层组成,对输入图像的高维特征进行非线性变换,将其映射到一个低维特征空间,从而提取图像中的有用信息。然后对嵌入维度vi进行归一化,L2范数归一化可使不同的特征维度具有同等重要性,以避免某些特征维度对损失函数的影响过大,使模型更加稳定。
c. 对比损失。对比损失强制要求来自正常驾驶类别的归一化在嵌入彼此之间时比来自异常行为类别的嵌入更加接近。因此,对比损失中的正样本对总是从正常驱动剪辑中选择,而异常驱动剪辑仅用作负样本。具体来说,对比损失函数将两个样本的特征表示作为输入,计算它们的相似性得分,并与预先定义的阈值进行比较。得分高于阈值,表示两个样本在特征空间中相似,低于阈值则表示不相似。
2D卷积虽然能够提取图像的空间特征,但由于缺少时序信息,很难用于视频识别。3D卷积引入了时序的上下文信息,可以更好地提取时空特征,捕捉视频的运动信息和场景变化。2D卷积与3D卷积的区别如图4所示。
在驾驶员行为识别任务中,为将算法更好地适配到资源有限的车载设备,需对网络进行轻量化处理。SqueezeNet[14]使用Fire卷积模块减少参数量,从而实现轻量化。ShuffleNet[15]使用逐点卷积和通道混洗降低计算量、提高准确率。EfficientNet[16]通过同时对网络深度、宽度和图像分辨率进行缩放,实现在降低模型体积和计算成本的同时,提高模型的准确性。此外,还有NASNet[17]、FBNet[18]等轻量化网络。
本文选用轻量化网络MobileNetV2[19],它以MobileNetV1为基础,使用可分离卷积代替传统卷积,分别对空间特征、通道特征进行卷积处理,大幅减少了参数数量和计算量,且在识别检测中应用广泛[20]。本文对其进行3D转换,使其能够利用3D卷积捕捉连续帧之间的运动状态,从而提高状态预测能力。3D-MobileNetV2从卷积层1开始,然后通过7个MobileNetV2模块及卷积层2,最后进入池化层和全连接层。模型结构如图5所示,模块1~模块7的结构如图6所示,模型体系架构如表1所示。其中,FF′为图像的通道数,DWH分别为图像的深度、宽度、高度,Conv表示卷积,DwConv表示深度可分离卷积,BN表示归一化,ReLU6为激活函数,是线性修正单元(Rectified Linear Unit,ReLU)的变体,Add表示将多个特征图或张量相加,Stride表示步幅。
为提高预测的准确性,本文对3D-MobileNetV2进行改进,在原网络基础上,在每个瓶颈(Bottleneck)的中间卷积层加入通道混洗,以减少模型参数和计算量,同时提高模型的精度。通道混洗是一种新型的卷积神经网络结构,通过将输入特征图分组并对每个组内的通道进行混洗,实现通道之间的信息交流。该方法可在不增加计算量的前提下,提高模型的精度和泛化能力。同时,在Bottleneck输出后接入批量规范化(BatchNorm)层和Mish激活函数。BatchNorm通过规范化的手段减少内部协变量偏移,使激活函数的输入值位于激活函数对输入较为敏感的区域,从而使梯度变大,提高神经网络的训练速度和泛化能力,防止过拟合。Mish激活函数没有上限和饱和区域,在训练过程中不会出现梯度消失的问题。3D-MobileNetV2的Bottleneck结构如表2所示。
在训练的迭代中,每个批量包括K个正常视频剪辑和M个异常视频剪辑。编码层和投影层创建的视觉序列为vnivaii∈{1,…,K+M},分别来自正常和异常训练剪辑。每个批量包括K(K-1)个正常样本对和KM个异常样本对,将其输入到对比损失函数中。Lij通过计算由温度参数τ和指数函数缩放的潜在表示之间的点积来计算正常样本对与异常样本对之间的相似性:
${L}_{ij}=-log\frac{K\cdot exp\left(\frac{{v}_{ni}^{T}{v}_{nj}}{\tau }\right)}{K\cdot exp\left(\frac{{v}_{ni}^{T}{v}_{nj}}{\tau }\right)+\frac{1}{M}{\sum }_{m=1}^{M}exp\left(\frac{{v}_{ni}^{T}{v}_{am}}{\tau }\right)}$
每个批量的损失函数为:
$L=\frac{1}{K(K-\left.1\right)}\sum _{i=1}^{K}\sum _{i=1}^{K}{1}_{i\left(j\right)}{L}_{ij}$
其中:
${1}_{i\left(j\right)}=\left\{\begin{array}{l}1,i=j\\ 0,i\ne j\end{array}\right.$
式中:τ∈(0,∞)为温度参数,可以影响模型的输出概率分布,提高泛化能力和鲁棒性。
数据本身、错误标注或噪声导致异常样本与正常样本可能相似,以致负样本对的求和较大,造成优化困难。为解决该问题,对异常样本对的总和进行缩放,使聚合的异常样本对的整体相似度保持较小,并增加正常样本对的权重,以平衡公式中各项的影响力。
在使用对比损失训练后,投影层通常在测试阶段被丢弃[12,17]。在自监督对比学习的背景下,编码阶段学习的潜在表示hi将更加通用,并进一步作为分类任务的预训练模型[12]。对于监督学习任务,投影层(vi)学习到的表示可能更有价值,因为它们利用投影层上学习到的对比损失与监督学习任务保持正常和异常类的距离一致。通过计算式(2),在同一批量中,编码器被更新,以最大化正常驱动特征向量之间的相似性、最小化正常驱动向量及所有其他异常驱动向量之间的相似性。
目前,大多数驾驶员行为检测方法采用可见光摄像头作为图像采集的传感器。可见光传感器易受环境光照强度影响,光照条件较差可能会影响驾驶员行为检测的准确率,造成一定的安全隐患。为解决上述问题,本文在乘员舱的前方和正上方放置2个深度相机,可以获取场景的深度信息,并可在低光照环境下工作。深度图像是将采集器采集到的场景中各点的距离作为像素值的图像,直接反映景物可见表面的几何形状,可以更好地完成驾驶员行为检测任务。前置摄像头记录驾驶员头部、身体的动作,顶部摄像头记录驾驶员双手的动作。2个相机能更好地反映驾驶员的驾驶状态,解决视野盲区的问题。
本文使用驾驶员异常检测(Driver Anomaly Detection,DAD)数据集[17]。DAD数据集的分辨率为224像素×171像素,帧率为45 帧/s。该数据集包含550 min的正常驾驶记录和100 min的异常驾驶记录,其中异常驾驶行为包括头部和身体的运动(伸手到后面、与乘员交谈)及手部交互(手机通话、发消息,喝水)等。
本文试验设置的批量大小是32,最大迭代次数为100。网络使用随机梯度下降(Stochastic Gradient Descent,SGD)优化器进行参数更新。初始学习率设置为0.01,以0.9的速率进行指数衰减。试验使用的图形处理器为NVIDIA GeForce RTX 3060、中央处理器为2.7 GHz Intel Core i7-12700H。
本文采用准确率(Accuracy)、受试者操作特征(Receiver Operating Characteristic,ROC)曲线、ROC曲线下面积(Area Under Curve,AUC)作为模型分类性能的指标。
准确率能够评价模型对全部样本的判断能力:
$A=\frac{{N}_{TP}+{N}_{TN}}{{N}_{TP}+{N}_{FP}+{N}_{TN}+{N}_{FN}}$
式中:NTP、NTN、NFP、NFN分别为真正例、真负例、假正例、假负例的数量。
ROC是一种二分类模型的性能度量,是以假正率(False Positive Rate,FPR)为横坐标、真正率(True Positive Rate,TPR)为纵坐标绘制的曲线。假正率RFP、真正率RTP的计算公式分别为:
${R}_{FP}=\frac{{N}_{FP}}{{N}_{FP}+{N}_{TN}}$
${R}_{TP}=\frac{{N}_{TP}}{{N}_{FP}+{N}_{TN}}$
AUC用于二分类模型的评价,其取值范围为0.5~1.0,越接近1.0,说明检测方法真实性越强,等于0.5时真实性最低、无应用价值。
CNN通常指用于图像分类的二维CNN,3D CNN增加了时间维度,可以从时间和空间维度上提取特征。为验证三维卷积模型的有效性,本文分别进行2D、3D卷积网络试验,试验结果如表3所示。
3D CNN模型用于将每个正常驾驶训练剪辑xi, i∈{1,…,N}编码为一组归一化的512维特征表示。然后,可计算正常驾驶模板向量vn
${v}_{n}=\frac{1}{N}\sum _{i=1}^{N}\frac{{f}_{\theta }\left({x}_{i}\right)}{{‖{f}_{\theta }\left({x}_{i}\right)‖}_{2}}$
为对测试视频片段xi进行分类,再次将其编码为归一化的512维向量,并计算编码片段与vn之间的余弦相似度Ki
${K}_{i}={v}_{n}^{T}\frac{{f}_{\theta }\left({x}_{i}\right)}{{‖{f}_{\theta }\left({x}_{i}\right)‖}_{2}}$
最后,将相似度得分低于阈值γ的片段归类为异常驾驶行为。同时,Ki表征了异常行为的严重程度。
为更加直观地分析本文方法的性能,部分测试可视化结果如图7图8所示,表明本文方法能够准确检测出驾驶员是否具有异常驾驶行为。
为验证本文提出的双视角和损失函数的有效性,保持试验原有的设置条件,对网络进行重新训练。
双视角使用决策级融合的方法将两个视角的信息融合在一起,通过特征提取、特征融合、决策模型、融合策略等多个环节实现信息的融合与决策:
${K}_{\left(add\right)}=\frac{{K}_{\left(top\right)}+{K}_{\left(front\right)}}{2}$
式中:K(add)K(top)K(front)分别为相加的融合相似度、顶部视角融合相似度、前部视角融合相似度。
双视角有效性验证方面,本文对前置视角、顶部视角及前置和顶部双视角3种不同视角的有效性进行说明。3种视角的准确率、AUC对比如表4所示,本文提出的双视角方案的2项指标明显优于另外2种视角方案。
损失函数有效性验证方面,分别使用交叉熵损失(Cross-Entropy Loss)函数、加权交叉熵损失函数及本文提出的对比损失函数对网络进行重新训练,并根据评价指标进行比较。
交叉熵损失通常用于解决分类问题,衡量模型的预测结果与真实标签之间的差距。在二分类的情况下,其计算公式为:
${L}_{CEL}=\frac{1}{N}\sum _{i}-[{y}_{i}\cdot log({p}_{i})+(1-{y}_{i})\cdot log(1-{p}_{i}\left)\right]$
式中:LCEL为交叉熵损失,N为样本总数,yi为样本i的标签(正类为1,负类为0),pi为样本i预测为正类的概率。
加权交叉熵损失函数是对交叉熵损失函数的改进,在计算损失时对不同类别的样本赋予不同的权重。在二分类问题中,其计算公式为:
$L\left(w\right)=-\frac{1}{N}\sum _{i=1}^{N}\left[{w}_{i}{y}_{i}log\right({p}_{i})+(1-{y}_{i}\left)log\right(1-{p}_{i}\left)\right]$
式中:L(w)为加权交叉熵损失函数。
不同损失函数的试验结果如表5所示,本文提出的对比损失函数的2项指标明显优于另外2种函数。
现有的深度学习模型均基于历史数据进行训练,对于新情况的适应能力较差。在实际驾驶过程中,驾驶行为的多样性给识别任务带来了困难。为验证模型的泛化性,选用训练集中未出现的驾驶类别进行验证。本文提出的算法对摘/戴眼镜、调整后视镜、吃东西等新出现的异常行为仍能够准确分类,部分可视化结果如图9所示。
本文针对驾驶员行为检测中光照强度变化、视角盲区以及姿态多样性的问题,提出了一种对比学习的检测方法,通过学习正常驾驶行为的嵌入,检测异常驾驶行为。本文方法在驾驶员异常检测数据集上取得了优异的效果,在后续研究中,可在网络中加入注意力机制,使网络更加关注驾驶员的头部和手部,提升驾驶员异常行为识别的准确率。
  • *吉林省市场监督管理厅科技计划项目(2023MK008)
参考文献 引证文献
排序方式:
[1]
YANG Y B, ZHOU L J, LENG M J, et al. Early Warning of Traffic Accident in Shanghai Based on Large Data Set Mining[C]// 2016 International Conference on Intelligent Transportation, Big Data & Smart City (ICITBS). Changsha, China: IEEE, 2016.
[2]
DINGUS T A, GUO F, LEE S, et al. Driver Crash Risk Factors and Prevalence Evaluation Using Naturalistic Driving Data[J]. Proceedings of the National Academy of Sciences of the United States of America, 2016, 113(10): 2636-2641.
[3]
陈军, 张黎, 周博, 等. 基于级联卷积神经网络的驾驶员分心驾驶行为检测[J]. 科学技术与工程, 2020, 20(14): 5702-5708.
CHEN J, ZHANG L, ZHOU B, et al. Driver Distracted Driving Behavior Detection Based on Cascaded Convolutional Neural Networks[J]. Science Technology and Engineering, 2020, 20(14): 5702-5708.
[4]
TAWARI A, TRIVEDI M M. Robust and Continuous Estimation of Driver Gaze Zone by Dynamic Analysis of Multiple Face Videos[C]// 2014 IEEE Intelligent Vehicles Symposium Proceedings. Dearborn, Michigan, USA: IEEE, 2014: 344-349.
[5]
高菲, 杨柳, 李晖. 开放集识别研究综述[J]. 南京大学学报(自然科学), 2022, 58(1): 115-134.
GAO F, YANG L, LI H. A Review of Open Set Identification Research[J]. Journal of Nanjing University (Natural Science), 2022, 58(1): 115-134.
[6]
CHEN J M, WANG G J, J C, et al. Open-Set Classification for Signal Diagnosis of Machinery Sensor in Industrial Environment[J]. IEEE Transactions on Industrial Informatics, 2023, 19(3): 2574-2584.
[7]
王一丰, 郭渊博, 陈庆礼, 等. 基于对比学习的细粒度未知恶意流量分类方法[J]. 通信学报, 2022, 43(10): 12-25.
WANG Y F, GUO Y B, CHEN Q L, et al. A Fine-Grained Approach to Classify Unknown Malicious Traffic Based on Comparative Learning[J]. Journal on Communications, 2022, 43(10): 12-25.
[8]
CHEN T, KORNBLITH S, NOROUZI M, et al. A Simple Framework for Contrastive Learning of Visual Representations[C]// Proceedings of the 37th International Conference on Machine Learning. Vienna, Austria: PMLR, 2020: 1597-1607.
[9]
GRILL J B, STRUB F, ALTCHÉ F, et al. Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning[C]// Conference on Neural Information Processing Systems (Neur IPS 2020). Online: Neur IPS, 2020: 21271-21284.
[10]
CARON M, MISRA I, MAIRAL J, et al. Unsupervised Learning of Visual Features by Contrasting Cluster Assignments[C]// Conference on Neural Information Processing Systems (Neur IPS 2020). Online: Neur IPS, 2020: 9912-9924.
[11]
ZHANG H Y, CISSE M, DAUPHIN Y N, et al. Mixup: Beyond Empirical Risk Minimization[C]// International Conference on Learning Representations. Vancouver, Canada: ICLR, 2018.
[12]
YUN S D, HAN D Y, CHUN S H, et al. CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features[C]// International Conference on Computer Vision (ICCV 2019). Seoul, Korea: IEEE/CVF, 2019: 6022-6031.
[13]
KOPUKLU O, ZHENG J P, XU H, et al. Driver Anomaly Detection: A Dataset and Contrastive Learning Approach[C]// 2021 IEEE Winter Conference on Applications of Computer Vision. Waikoloa, HI, USA: IEEE, 2021: 91-100.
[14]
IANDOLA F N, HAN S, MOSKEWICZ M W, et al. SqueezeNet: AlexNet-Level Accuracy with 50x Fewer Parameters and <0.5MB Model Size[C]// International Conference on Learning Representations. Toulon, France: ICLR, 2017.
[15]
ZHANG X Y, ZHOU X Y, LIN M X, et al. ShuffleNet:An Extremely Efficient Convolutional Neural Network for Mobile Devices[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018). Salt Lake City, Utah, USA: IEEE/CVF, 2018: 6848-6856.
[16]
TAN M X, LE Q V. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks[C]// International Conference on Machine Learning. Long Beach, California, USA: PMLR, 2019: 6105-6114.
[17]
ZOPH B, VASUDEVAN V, SHLENS J, et al. Learning Transferable Architectures for Scalable Image Recognition[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018). Salt Lake City, Utah, USA: IEEE/CVF, 2018: 8697-8710.
[18]
WU B C, DAI X L, ZHANG P Z, et al. FBNet:Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search[C]// 2019 IEEE/CVP Conference on Computer Vision and Pattern Recognition (CVPR 2019). Long Beach, California, USA: IEEE/CVP, 2019: 10734-10742.
[19]
SANDLER M, HOWARD A, ZHU M, et al. MobileNetV2: Inverted Residuals and Linear Bottlenecks[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018). Salt Lake City, Utah, USA: IEEE/CVF, 2018: 4510-4520.
[20]
张东, 高丙朋. 基于改进MobileNet网络的人脸识别方法[J]. 山西大学学报(自然科学版), 2023, 46(1): 147-153.
ZHANG D, GAO B P. Facial Recognition Method Based on Improved MobileNet Network[J]. Journal of Shanxi University (Natural Science Edition), 2023, 46(1): 147-153.
2025年第卷第8期
PDF下载
453
189
引用本文
BibTeX
文章信息
doi: 10.20104/j.cnki.1674-6546.20240304
  • 首发时间:2025-10-29
  • 出版时间:2025-08-15
补充材料
相关文章
文章信息
作者
出版历史
  • 修回日期:2024-07-05
基金
*吉林省市场监督管理厅科技计划项目(2023MK008)
作者信息
    吉林省产品质量监督检验院, 长春 130013
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/qcgcs/CN/10.20104/j.cnki.1674-6546.20240304
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏