A Micro-Services-Based Approach for Curation and Preservation ...

  • Published on
    14-Feb-2017

  • View
    215

  • Download
    1

Transcript

FromPreservationtoCuration: extendingboundaries,creatingnewservices, engagingnewusersPatriciaCruse/StephenAbramsUniversityofCaliforniaCurationCenter CaliforniaDigitalLibraryNDIIPP/NDSAPartnerMeetingJuly1921,2011Ourenvironmentcirca20022008FocusonpreservationStakeholders:memory organizationsInfrastructure:staticServices:hostedContent:museumand librarySustainability:?Thechanginglandscape Everincreasingnumber,size,and diversityofcontent Everincreasingdiversityof partners,andstakeholders Decreasingresources Inevitabilityofdisruptivechange Technology Institutionalmission Users changingexpectations andneedsWhatkeepsusersupatnight?Whatis metadata?Aretherestandardsor bestpracticesIshouldbe awareof?Howmuchwill itcost??WhyshouldIcare aboutpreservation?I justneedaplaceto putmydata.WherecanI gethelp?HowcanIsharemy workwithmy colleagues?HowcanIpublishthe dataassociatedwith mypublications?HowdoIfulfillthe datamanagement requirementsofmy grant?HowcanImake sureIgetcredit?Cantmyworkbe includedinthe WebofScience?HowcanIprovide accesstomy work?Fourquestionsorimperatives? Howcanwebestrespondorganizationally? Howdoesourtechnicallandscapechange? Whatisthevalueofourservicestoourdiverse communityofusers? Howcanwebuild(orreach)newcommunities?UniversityofCaliforniaCurationCenterCreativepartnershipbetweentheCDL,the10UC campuses,individualsandpeerinstitutions Acommunityofshared concernandpractice Achanneltopooland distributediverse experience,expertise,and resources Robust,innovative,and costeffectivesolutionsto counteractinevitable disruptivechangeUCCurationCentersenvironmenttodayOrganization&Stakeholders: UClibraries,UC community,andbeyondFocusoncurationandentire informationlifecycleTechnology and Infrastructure:simple, flexible,adaptableServices:diverseContent:agnosticSustainability:amustUCCommunityExternaltoUCThankstoMacKenzieSmith IDCC2010DataManagementPlanning(DMP)ToolFundingagenciesrequiringa DMP1. connectresearchers withresources2. streamlinetheprocess toproduceacredible andhighqualityplan formanagingdataEightinstitutionscoming togetherToolwillhavemultiplephasesDMPToolOutoftheBox1.Forallusers Stepbystepwizardforgeneratingdatamanagementplans Generalguidanceforeachsection:helptextandresourcesrelevant toall SaveaplanasPDF,MS Word,plaintextor generatealinktoaPDF versionofthefinished plan2.ForDMPToolPartners Customizedlinksto resourcesavailable toallinstitutions researchersEZID:longtermidentifiersmadeeasytakecontrolofthemanagementanddistributionofyourresearch,shareandgetcreditforit,andbuildyourreputationthroughitscollectionanddocumentationPrimaryFunctions1.Createpersistentidentifiers2.Manageidentifiersovertime3.ManageassociatedmetadataovertimeEZIDsupportsawishlistfordataasakey componentofscholarlycommunicationSupportingresearchers Preciseidentificationofa dataset Credittodataproducersand datapublishers Alinkfromthetraditional literaturetothedata ResearchmetricsfordatasetsSupportingacommunityBusinessmodel TieredpricingstructureforUC, nonUC,forprofit Revenuesupportsoperations anddevelopment Rangeofcustomers: governmentagencies, researchcenters,institutions, forprofitWorkingwithpublishersto exposedataaspublicationServiceoverviewOpentotheUCcommunity&beyondDiscipline/contentagnosticServicedelivery:hostedorlocal deployedEasytouseUIorAPIPrimaryFunctions1.Deposit2.Manage(metadata,versions,etc)3.Share(withotherresearchers)4.Access(expose)5.PreserveMerrittsdiverseserviceofferingtothecommunityMerrittsserviceoffering Darkarchiveforimportant digitalassets Brightarchivewithdirect discoveryandaccess Preservationbackendfor existingornewdiscoveryand contentmanagementsystems Integrationwithdistributed datagrids LocaldeploymentsSupportingthecommunityBusinessmodel PricingstructureforUC,non UC,andforprofit Payasyougo Payoncestoreforever Revenuesupportsoperation anddevelopment Rangeofcustomers: governmentagencies, researchcenters,institutions, forprofitWebArchivingServiceCapturetodaysweb,buildtomorrowsarchivePrimaryFunctions1.Collectwebpublishedcontent2.Managecontent3.PublishcontentforpublicaccessWASserviceoverviewBusinessmodelinplace PricingstructureforUC, nonUC,andforprofit Servicefeeandstorageused Revenuesupports operationsand development Rangeofcustomers: agencies,researchcenters, academicinstitutions, researchers,librariesDigitalCurationforEXCEL(DCXL)Project OpensourceMSExceladdinsProblemstatementDataarethebuildingblocksof scientificresearch.ManyscientistsuseMSExcel torecord,manage,view, graph,andmanipulate datasets.Excelscurrentfeaturesetcan beabarriertosharing, verifying,andpreservingDCXLOutputs Requirements(open) Opensourceaddin interoperable, sharable, publishable, archivable NewcommunityofpracticeWhatanExceladdincoulddoSomepreliminaryideastobetter publish,share,andarchive Permitstandardizedcolumn headers Versioningandstandard dateformats Autoarchivingand persistentidassignment Speedbumps to discouragemacrosetal.Participants UC3attheCDL UCCampuscommunity DataONE Thebroadercommunity MSResearch GordonandBettyMoore FoundationDeliverydate:Spring2012Visionforadatapaper Idea:wraptheunfamiliarinafamiliar faade Adatapaperminimallyconsistsofa coversheetandasetoflinkstoarchived artifacts Coversheetcontainsfamiliarelements: title,date,authors,abstract,and persistentidentifier(DOI,ARK,etc.) Justenoughtopermitbasicexposureto anddiscovery Buildingabasicdatacitation IndexingbyservicessuchasWebof Science,GoogleScholar Instillingconfidenceinthe identifiersstabilityDataPublishingattheCDLUCCurationCenter MerrittCurationrepository EZID:Persistentidmanagement andresolution(ARKs,DOIs,etal.)PublishingServicesProgram Onlinejournals,withpeerreview Scholarlycommunication:grey literaturetopostprints Searchanddisplaytools(XTF)Lessonslearned(andstilllearning)Goalistoworkonseveralfrontstomakeacomplex problemssmaller Dontcirclethewagons Stopdoingwhatyoucantsupport Outsourceand/orusethirdpartycomponents Deploynewinfrastructureandservicesthatcanbeusedin diverseways Engagewithnewcommunities researchcommunity Supportemerginginitiatives Collaboratenowmorethanever!UC3sspecialagentsattheCDLTracySeneca MargaretLow MarkReyesStephenAbramsPerryWillettMarisaStrong GregJaneeDavidLoyScottFisherCarlyStrasserTrishaCruseJohnKunzeErikHetnzerLisaColvinFrom Preservation to Curation: extending boundaries, creating new services, engaging new users Our environment circa 2002-2008The changing landscapeWhat keeps users up at night?Four questions or imperatives?University of California Curation CenterUC Curation Centers environment todayData Management Planning (DMP) ToolDMP Tool Out-of-the-Box EZID: long-term identifiers made easyEZID supports a wish list for data as a key component of scholarly communicationSlide Number 12Merritts diverse service offering to the communitySlide Number 14WAS service overviewDigital Curation for EXCEL (DCXL) ProjectOpen source MS Excel add-insWhat an Excel add-in could doVision for a data paper Data Publishing at the CDLLessons learned (and still learning)UC3s special agents at the CDL