A survey on transfer learning|江阴雨辰互联

2024年4月15日发(作者：24小时精准天气预报)

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

ASurveyonTransferLearning

SinnoJialinPanandQiangYangFellow,IEEE

Abstract—Amajorassumptioninmanymachinelearninganddataminingalgorithmsisthatthetrainingandfuturedatamustbe

r,inmanyreal-worldapplications,thisassumptionmaynothold.

Forexample,wesometimeshaveaclassiﬁcationtaskinonedomainofinterest,butweonlyhavesufﬁcienttrainingdatainanother

domainofinterest,wherethelatterdatacases,

knowledgetransfer,ifdonesuccessfully,wouldgreatlyimprovetheperformanceoflearningbyavoidingmuchexpensivedatalabeling

ntyears,transfrveyfocuseson

categorizingandreviewingthecurrentprogressontransferlearningforclassiﬁcation,survey,

wediscusstherelationshipbetweentransferlearningandotherrelatedmachinelearningtechniquessuchasdomainadaptation,multi-

tasklearningandsampleselectionbias,exploresomepotentialfutureissuesintransferlearning

research.

IndexTerms—TransferLearning,Survey,MachineLearning,DataMining.

✦

NTRODUCTION

Dataminingandmachinelearningtechnologieshavealready

achievedsigniﬁcantsuccessinmanyknowledgeengineering

areasincludingclassiﬁcation,,

[1],[2]).However,manymachinelearningmethodsworkwell

onlyunderacommonassumption:thetrainingandtestdataare

drawnfromthesamefeaturespaceandthesamedistribution.

Whenthedistributionchanges,moststatisticalmodelsneedto

manyrealworldapplications,itisexpensiveorimpossibleto

wouldbenicetoreducetheneedandefforttore-collectthe

cases,knowledgetransferortransfer

learningbetweentaskdomainswouldbedesirable.

Manyexamplesinknowledgeengineeringcanbefound

wheretransferlearningcantrulybebeneﬁmple

isWebdocumentclassiﬁcation[3],[4],[5],whereourgoal

istoclassifyagivenWebdocumentintoseveralpredeﬁned

ampleintheareaofWeb-document

classiﬁcation(,[6]),thelabeledexamplesmaybe

theuniversityWebpagesthatareassociatedwithcategory

informationobtainedthroughpreviousmanual-labelingefforts.

ForaclassiﬁcationtaskonanewlycreatedWebsitewherethe

datafeaturesordatadistributionsmaybedifferent,theremay

ult,wemaynotbe

abletodirectlyapplytheWeb-pageclassiﬁerslearnedonthe

cases,itwould

behelpfulifwecouldtransfertheclassiﬁcationknowledge

intothenewdomain.

Theneedfortransferlearningmayarisewhenthedatacan

case,thelabeleddataobtainedin

onetimeperiodmaynotfollowthesamedistributionina

mple,inindoorWiFilocalization

DepartmentofComputerScienceandEngineering,HongKongUniversityof

ScienceandTechnology,ClearwaterBay,Kowloon,HongKong

Emails:{sinnopan,qyang}@

problems,whichaimstodetectauser’scurrentlocationbased

onpreviouslycollectedWiFidata,itisveryexpensiveto

calibrateWiFidataforbuildinglocalizationmodelsinalarge-

scaleenvironment,becauseauserneedstolabelalarge

r,the

WiFisignal-strengthvaluesmaybeafunctionoftime,device

trainedinonetimeperiod

orononedevicemaycausetheperformanceforlocation

estimationinanothertimeperiodoronanotherdevicetobe

cethere-calibrationeffort,wemightwishto

adaptthelocalizationmodeltrainedinonetimeperiod(the

sourcedomain)foranewtimeperiod(thetargetdomain),or

toadaptthelocalizationmodeltrainedonamobiledevice(the

sourcedomain)foranewmobiledevice(thetargetdomain),

asdonein[7].

Asathirdexample,considertheproblemofsentiment

classiﬁcation,whereourtaskistoautomaticallyclassifythe

reviewsonaproduct,suchasabrandofcamera,intopositive

sclassiﬁcationtask,weneedto

ﬁrstcollectmanyreviewsoftheproductandannotatethem.

Wewouldthentrainaclassiﬁeronthereviewswiththeir

hedistributionofreviewdata

amongdifferenttypesofproductscanbeverydifferent,to

maintaingoodclassiﬁcationperformance,weneedtocollect

alargeamountoflabeleddatainordertotrainthereview-

classiﬁr,thisdata-

cethe

effortforannotatingreviewsforvariousproducts,wemay

wanttoadaptaclassiﬁcationmodelthatistrainedonsome

productstohelplearnclassiﬁcationmodelsforsomeother

cases,transferlearningcansaveasigniﬁcant

amountoflabelingeffort[8].

Inthissurveyarticle,wegiveacomprehensiveoverviewof

transferlearningforclassiﬁcation,regressionandclustering

hasbeenalargeamountofworkontransferlearningfor

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

[9],[10]).However,inthispaper,weonlyfocusontransfer

learningforclassiﬁcation,regressionandclusteringproblems

thesurvey,wehopetoprovideausefulresourceforthedata

miningandmachinelearningcommunity.

ext

foursections,weﬁrstgiveageneraloverviewanddeﬁne

brieﬂysurveythe

historyoftransferlearning,giveauniﬁeddeﬁnitionoftransfer

learningandcategorizetransferlearningintothreedifferent

settings(giveninTable2andFigure2).Foreachsetting,we

reviewdifferentapproaches,

that,inSection6,wereviewsomecurrentresearchonthe

topicof“negativetransfer”,whichhappenswhenknowledge

ion7,

weintroducesomesuccessfulapplicationsoftransferlearning

andlistsomepublisheddatasetsandsoftwaretoolkitsfor

y,weconcludethearticlewith

adiscussionoffutureworksinSection8.

VERVIEW

2.1ABriefHistoryofTransferLearning

Traditionaldataminingandmachinelearningalgorithmsmake

predictionsonthefuturedatausingstatisticalmodelsthatare

trainedonpreviouslycollectedlabeledorunlabeledtraining

data[11],[12],[13].Semi-supervisedclassiﬁcation[14],[15],

[16],[17]addressestheproblemthatthelabeleddatamay

betoofewtobuildagoodclassiﬁer,bymakinguseofa

largeamountofunlabeleddataandasmallamountoflabeled

ionsofsupervisedandsemi-supervisedlearning

forimperfectdatasetshavebeenstudied;forexample,Zhu

andWu[18]havestudiedhowtodealwiththenoisyclass-

eredcost-sensitivelearning

[19]whenadditionaltestscanbemadetofuturesamples.

Nevertheless,mostofthemassumethatthedistributionsof

erlearning,

incontrast,allowsthedomains,tasks,anddistributionsused

ealworld,we

mple,

wemayﬁndthatlearningtorecognizeapplesmighthelpto

rly,learningtoplaytheelectronicorgan

dyofTransfer

learningismotivatedbythefactthatpeoplecanintelligently

applyknowledgelearnedpreviouslytosolvenewproblems

damentalmotivation

forTransferlearningintheﬁeldofmachinelearningwas

discussedinaNIPS-95workshopon“LearningtoLearn”

,whichfocusedontheneedforlifelongmachine-learning

methodsthatretainandreusepreviouslylearnedknowledge.

Researchontransferlearninghasattractedmoreand

moreattentionsince1995indifferentnames:learningto

learn,life-longlearning,knowledgetransfer,inductivetrans-

fer,multi-tasklearning,knowledgeconsolidation,context-

sensitivelearning,knowledge-basedinductivebias,metalearn-

ing,andincremental/cumulativelearning[20].Amongthese,

:///courses/comp/dsilver/NIPS95LTL/

acloselyrelatedlearningtechniquetotransferlearningis

themulti-tasklearningframework[21],whichtriestolearn

multipletaskssimultaneouslyevenwhentheyaredifferent.

Atypicalapproachformulti-tasklearningistouncoverthe

common(latent)featuresthatcanbeneﬁteachindividualtask.

In2005,theBroadAgencyAnnouncement(BAA)05-29

ofDefenseAdvancedResearchProjectsAgency(DARPA)’s

InformationProcessingTechnologyOfﬁce(IPTO)

gavea

newmissionoftransferlearning:theabilityofasystemto

recognizeandapplyknowledgeandskillslearnedinprevious

deﬁnition,transferlearningaims

toextracttheknowledgefromoneormoresourcetasksand

rasttomulti-task

learning,ratherthanlearningallofthesourceandtargettasks

simultaneously,transferlearningcaresmostaboutthetarget

esofthesourceandtargettasksarenolonger

symmetricintransferlearning.

Figure1showsthedifferencebetweenthelearning

wecansee,traditionalmachinelearningtechniquestrytolearn

eachtaskfromscratch,whiletransferlearningtechniquestry

totransfertheknowledgefromsomeprevioustaskstoatarget

taskwhenthelatterhasfewerhigh-qualitytrainingdata.

(a)TraditionalMachineLearning(b)TransferLearning

entLearningProcessesbetweenTraditional

MachineLearningandTransferLearning

Today,transferlearningmethodsappearinseveraltop

venues,mostnotablyindatamining(ACMKDD,IEEEICDM

andPKDD,forexample),machinelearning(ICML,NIPS,

ECML,AAAIandIJCAI,forexample)andapplicationsof

machinelearninganddatamining(ACMSIGIR,WWWand

ACLforexample)

.Beforewegivedifferentcategorizations

oftransferlearning,weﬁrstdescribethenotationsusedinthis

article.

2.2NotationsandDeﬁnitions

Inthissection,weintroducesomenotationsanddeﬁnitions

fall,wegivethedeﬁnitions

ofa“domain”anda“task”,respectively.

Inthissurvey,adomainDconsistsoftwocomponents:a

featurespaceXandamarginalprobabilitydistributionP(X),

whereX={x

,...,x

}∈mple,ifourlearning

:///ipto/programs/tl/

arizealistofconferencesandworkshopswheretransfer

learningpapersappearinthesefewyearsinthefollowingwebpagefor

reference,/∼sinnopan/

发布者：admin，转转请注明出处：http://www.yc00.com/xitong/1713144092a2191556.html

A survey on transfer learning

发表回复

评论列表（0条）

联系我们

400-800-8888

A survey on transfer learning

相关推荐

发表回复

评论列表（0条）

联系我们

400-800-8888