Towards Internet-scale Multi-view Stereo|江阴雨辰互联

2024年4月21日发(作者：魔声钻石之泪耳机)

TowardsInternet-scaleMulti-viewStereo

YasutakaFurukawa

GoogleInc.

Abstract

Thispaperintroducesanapproachforenablingexist-

ingmulti-viewstereomethodstooperateonextremelylarge

nideaistodecom-

posethecollectionintoasetofoverlappingsetsofphotos

thatcanbeprocessedinparallel,andtomergetheresult-

erlappingclusteringproblem

isformulatedasaconstrainedoptimizationandsolvedit-

gingalgorithm,designedtobeparallel

andout-of-core,incorporatesrobustﬁlteringstepstoelim-

inatelow-qualityreconstructionsandenforceglobalvisi-

roachhasbeentestedonseveral

,includingone

withovertenthousandimages,yieldinga3Dreconstruc-

tionwithnearlythirtymillionpoints.

BrianCurless

1,2

RichardSzeliski

UniversityofWashington

MicrosoftResearch

sereconstructionofPiazzaSanMarco(Venice)

from13,703imageswith27,707,825reconstructedMVSpoints

(furtherupsampledx9forhighqualitypoint-basedrendering).

uction

Thestateoftheartin3Dreconstructionfromimageshas

dwith

theexplosionofimageryavailableonlineandadvancesin

computing,wehavetheopportunitytorunreconstruction

,wecannowattemptto

,everybuilding,landscape,

and(static)objectthatcanbephotographed.

Themostimportanttechnologicalingredientstowards

SIFT[17])provideaccuratecorrespondences,structure-

from-motion(SFM)algorithmsusethesecorrespondences

toestimateprecisecamerapose,andmulti-view-stereo

(MVS)methodstakeimageswithposeasinputandproduce

dense3Dmodelswithaccuracynearlyonparwithlaser

scanners[22].Indeed,thistypeofpipelinehasalreadybeen

demonstratedbyafewresearchgroups[11,12,14,19],

withimpressiveresults.

Toreconstructeverything,onekeychallengeisscala-

bility.

Inparticular,howcanwedevisereconstructional-

,onthemillions

areotherchallengessuchashandlingcomplexBRDFsand

lightingvariations,whichwedonotaddressinthispaper.

There

GivenrecentprogressonInternet-scalematchingandSFM

(notablyAgarwaletal.’sRome-in-a-dayproject[1]),wefo-

cusoureffortsinthispaperonthelaststageofthepipeline,

i.e.,Internet-scaleMVS.

MVSalgorithmsarebasedontheideaofcorrelating

measurementsfromseveralimagesatoncetoderive3D

Salgorithmsaimatrecon-

structingaglobal3Dmodelbyusingalltheimagesavail-

ablesimultaneously[9,13,20,24].Suchanapproachisnot

d,itbecomes

importanttoselecttherightsubsetofimages,andtocluster

themintomanageablepieces.

Weproposeanovelviewselectionandclusteringscheme

thatallowsawideclassofMVSalgorithmstoscaleupto

edwithanewmergingmethod

thatrobustlyﬁltersoutlow-qualityorerroneouspoints,we

demonstrateourapproachrunningforthousandsofimages

temistheﬁrstto

demonstrateanunstructuredMVSapproachatcity-scale.

Weproposeanoverlappingviewclusteringproblem[2],

inwhichthegoalistodecomposethesetofinputimages

pisimportant

fortheMVSproblem,asastrictpartitionwouldundersam-

ustered,we

applyastate-of-the-artMVSalgorithmtoreconstructdense

3Dpoints,andthenmergetheresultingreconstructionsinto

ﬁlteringalgo-

rithmsareintroducedtohandlereconstructionerrorsand

thevastvariationsinreconstructionqualitythatoccurbe-

tweendistantandnearbyviewsofobjectsinInternetphoto

ﬁltersaredesignedtobeout-of-coreand

parallel,inordertoprocessalargenumberofMVSpoints

efﬁvisualizationsofmodelscontaining

tensofmillionsofpoints(seeFigure1).

dWork

ScalabilityhasrarelybeenaconsiderationinpriorMVS

algorithms,aspriordatasetshavebeeneitherrelatively

small[22],avideosequence

whichcanbedecomposedintoshorttimeintervals[19]).

Nevertheless,somealgorithmslendthemselvesnaturally

icular,severalalgorithmsoperate

bysolvingforadepthmapforeachimage,usingalocal

neighborhoodofnearbyimages,andthenmergetheresult-

ingreconstructions[11,12,18,19].Eachdepthmapcan

r,the

depthmapstendtobenoisyandhighlyredundant,leading

ore,thesealgorithms

typicallyrequireadditionalpost-processingstepstoclean

upandmergethedepthmaps.

ManyofthebestperformingMVSalgorithmsinstead

reconstructaglobal3Dmodeldirectlyfromtheinputim-

ages[9,13,20,24].Globalmethodscanavoidredun-

dantcomputationsandoftendonotrequireaclean-uppost-

process,eptionisJancoseketal.

[14]whoachievescalabilitybydesigningthealgorithmout-

r,rast,

weseekanout-of-corealgorithmthatisalsoparallelizable.

Withdepth-mapbasedMVSalgorithms,severalauthors

havesucceededinlarge-scaleMVSreconstructions[18,

19].Pollefeysetal.[19]presentareal-timeMVSsys-

timateadepthmap

foreachinputimage,reducenoisebyfusingnearbydepth

maps,andmergetheresultingdepthmapsintoasingle

ketal.[18]proposeapiece-wise

planardepthmapcomputationalgorithmwithverysimilar

r,bothmethodshave

beentestedonlyonhighlystructured,street-viewdatasets

obtainedbyavideocameramountedonamovingvan,and

nottheunstructuredphotocollectionsthatweconsiderin

thispaper,whichposeadditionalchallenges.

Besidesscalability,variationinreconstructionqualityis

anotherchallengeinhandlinglargeunorganizedimagecol-

lections,assurfacesmaybeimagedfrombothcloseupand

eetal.[12]proposedtheﬁrstMVSmethod

appliedtoInternetphotocollections,whichhandlesvaria-

tioninimagesamplingresolutionsbyselectingimageswith

etal.[10]select

imagesatdifferentbaselinesandimageresolutionstocon-

SFM point

SFM points

image

images

cluster

, ...}

Images

, ...}

Image clusters

, ...}

wclusteringalgorithmtakesimages{I

visibility

}

information

},SFM

points{P

},andtheirassociated

},then

producesoverlappingimageclusters{C

thesemethodshandlevariation

ech-

niquesmaybeusedinconjunctionwiththemethodspro-

posedhere,butthemajordifferenceinourworkisthatwe

alsohandlethevariationinapost-processingstep,when

thatsomepriordepth

mapmergingalgorithmstakeintoaccountestimatesofun-

,bytakingweightedcombinationsofdepth

samplestorecoveramesh[4,25].Whilesuchapproaches

canhandlenoisevariation,weﬁndtheydonotperformwell

forlargeInternetphotocollections,whereresolutionvaria-

tionisamajorfactor,becausecombininghighandlowreso-

lutiongeometriesinthestandardwayswilltendtoattenuate

eadproposeasimplemerg-

ingstrategythatﬁltersoutlowresolutiongeometry,which

wehavefoundtoberobustandwell-tailoredtorecovering

apoint-basedmodelasoutput.

clusteringalgorithmisexplainedinSection2,anddetails

oftheMVSpointmergingandrenderingaregiveninSec-

mentalresultsareprovidedinSection4and

lementationof

theproposedview-clusteringalgorithmisavailableat[6].

ustering

Weassumethatourinputimages{I

algorithmtoyieldcamera

}havebeenpro-

cessedbyanSFMposesanda

sparsesetof3Dpoints{P

},eachofwhichisvisibleina

setofimagesdenotedbyV

.WetreattheseSFMpointsas

sparsesamplesofthedensereconstructionthatMVSwill

,theycanbeusedasabasisforviewclus-

eciﬁcally,thegoalofviewclusteringisto

ﬁnd(anunknownnumberof)overlappingimageclusters

}suchthateachclusterisofmanageablesize,andeach

SFMpointcanbeaccuratelyreconstructedbyatleastone

oftheclusters(seeFigure2).

mFormulation

Theclusteringformulationisdesignedtosatisfythefol-

lowingthreeconstraints:(1)redundantimagesareexcluded

fromtheclusters(compactness),(2)eachclusterissmall

enoughforanMVSreconstruction(sizeconstraint);and

发布者：admin，转转请注明出处：http://www.yc00.com/num/1713708078a2302656.html

Towards Internet-scale Multi-view Stereo

发表回复

评论列表（0条）

联系我们

400-800-8888

Towards Internet-scale Multi-view Stereo

相关推荐

发表回复

评论列表（0条）

联系我们

400-800-8888