{"refrec":{"BRefID":324519,"RR":"<b>Sagi, T.; Lehahn, Y.; Bar, K.</b> (2020). Artificial intelligence for ocean science data integration: current state, gaps, and way forward. <i>Elem. Sci. Anth.  8(1)</i>: 21. <a href=\"https://dx.doi.org/10.1525/elementa.418\" target=\"_blank\">https://dx.doi.org/10.1525/elementa.418</a>","BEntID":317995,"PublicFlag":1,"CheckedFlag":0,"wosflag":1,"vabbflag":0,"RefStringPartII":". <i>Elem. Sci. Anth.  8(1)</i>: 21. <a href=\"https://dx.doi.org/10.1525/elementa.418\" target=\"_blank\">https://dx.doi.org/10.1525/elementa.418</a>","DocTypID":8,"DocType":"Journal article","MarineFlag":0,"FreshFlag":0,"BrackishFlag":0,"TerrestrialFlag":0,"Authorstring":"Sagi, T.; Lehahn, Y.; Bar, K.","OrigTitleTranslFlag":0,"Authorstringtrunc":"Sagi, T.; Lehahn, Y.; Bar, K.","Englishabstract":"Oceanographic research is a multidisciplinary endeavor that involves the acquisition of an increasing amount of in-situ and remotely sensed data. A large and growing number of studies and data repositories are now available on-line. However, manually integrating different datasets is a tedious and grueling process leading to a rising need for automated integration tools. A key challenge in oceanographic data integration is to map between data sources that have no common schema and that were collected, processed, and analyzed using different methodologies. Concurrently, artificial agents are becoming increasingly adept at extracting knowledge from text and using domain ontologies to integrate and align data. Here, we deconstruct the process of ocean science data integration, providing a detailed description of its three phases: discover, merge, and evaluate/correct. In addition, we identify the key missing tools and underutilized information sources currently limiting the automation of the integration process. The efforts to address these limitations should focus on (i) development of artificial intelligence-based tools for assisting ocean scientists in aligning their schema with existing ontologies when organizing their measurements in datasets; (ii) extension and refinement of conceptual coverage of – and conceptual alignment between – existing ontologies, to better fit the diverse and multidisciplinary nature of ocean science; (iii) creation of ocean-science-specific entity resolution benchmarks to accelerate the development of tools utilizing ocean science terminology and nomenclature; (iv) creation of ocean-science-specific schema matching and mapping benchmarks to accelerate the development of matching and mapping tools utilizing semantics encoded in existing vocabularies and ontologies; (v) annotation of datasets, and development of tools and benchmarks for the extraction and categorization of data quality and preprocessing descriptions from scientific text; and (vi) creation of large-scale word embeddings trained upon ocean science literature to accelerate the development of information extraction and matching tools based on artificial intelligence.","AbstractOtherLang":null,"BibLvlCode":"AS","StandardTitle":"Artificial intelligence for ocean science data integration: current state, gaps, and way forward","OrigTitleLangCode":"en","OrigTitleLangCodeExtended":"eng","OrigTitleLangID":15,"DateLastModified":{"date":"2026-05-07 01:32:56.281579","timezone_type":1,"timezone":"+02:00"},"UserAccessRight":null,"UserAccID":null,"AuthorKeywords":"Ontologies","OtherDescriptors":null,"Notes":null,"AnaPub":2020,"MonPub":null,"DateUpdate":"2020-06-08","DateCreate":"2020-05-26","SecASFANote":null,"ConfID":null,"PeerRev":1,"VlizCoreFlag":1,"WoScode":"WOS:000534604000001","VABBcode":null,"OpenAcc":1,"DOI":"10.1525/elementa.418"},"refs":null,"anarec":{"AnaID":324519,"PubliDate":2020,"Pagination":"21","XtraPublOfAnaID":null,"ISBN":null,"Volume":"8","Issue":"1","BRefMon":null,"BRefMonRR":null,"BRefXtra":null,"BRefXtraRR":null,"SerBRefID":267392,"SerRR":"Elementa Science of the Anthropocene. BioOne: Washington.  ISSN 2325-1026; e-ISSN 2325-1026","StandardTitleSer":"Elementa Science of the Anthropocene","ISSN":"2325-1026","AbbrevSer":"Elem. Sci. Anth. ","StandardTitleMon":null,"StartPage":21,"Pages":null,"ToPubliDate":null,"BRefBibLvlCode":"S","SerNotes":null},"monrec":null,"serrec":null,"relations":null,"relationsRev":null,"addrec":null,"othpubs":null,"ownerships":null,"authors":[{"AutName":"Sagi","Firstname":"Tomer","Initials":"T.","Affiliation":null,"Discriminator":null,"CorporateFlag":0,"BEntID":317995,"AutID":416994,"OrderNr":1,"DegrID":null,"EditorFlag":0,"CorrespFlag":0,"IllustratorFlag":0,"ReviserFlag":0,"TranslatorFlag":0,"InsAcronym":null,"InsFSN":null,"ORCID":null,"PersID":null,"InsID":null},{"AutName":"Lehahn","Firstname":"Yoav","Initials":"Y.","Affiliation":"Weizmann Inst Sci, Dept Earth & Planetary Sci, IL-76100 Rehovot, Israel.","Discriminator":null,"CorporateFlag":0,"BEntID":317995,"AutID":301791,"OrderNr":2,"DegrID":null,"EditorFlag":0,"CorrespFlag":0,"IllustratorFlag":0,"ReviserFlag":0,"TranslatorFlag":0,"InsAcronym":null,"InsFSN":null,"ORCID":null,"PersID":null,"InsID":null},{"AutName":"Bar","Firstname":"Koby","Initials":"K.","Affiliation":null,"Discriminator":null,"CorporateFlag":0,"BEntID":317995,"AutID":416996,"OrderNr":3,"DegrID":null,"EditorFlag":0,"CorrespFlag":0,"IllustratorFlag":0,"ReviserFlag":0,"TranslatorFlag":0,"InsAcronym":null,"InsFSN":null,"ORCID":null,"PersID":null,"InsID":null}],"mapdetails":null,"datasets":null,"monographs":null,"monparts":null,"serparts":null,"BEntOpen":null,"BEntPrivate":null,"availability":[{"BInstID":345337,"LibID":36,"BRefID":324519,"EmbargoDate":null,"FullEmbargoDate":null,"PhysMedID":16,"hasOCRd":1,"ShelfLocCode":"345337","RFID":null,"PaidValue":null,"Medium":"Server","Description":"VLIZ Open Access","Acronym":"VLIZ","Library":"Vlaams Instituut voor de Zee","DutchTerm":"Open access","URL":null,"ClassifID":53,"Classification":"Open access","ReqLink":null,"ClassifTypID":1,"URLLocation":"https://www.vliz.be/imisdocs/publications/","SubDir":null,"InternalReq":0,"LoggedInReq":0,"Disclaimer":null,"DutchDisclaimer":null,"FileFormat":".pdf","FileDescr":"pdf","InsPub":1,"InsID":36,"FileFormID":6,"LendableFlag":1,"PublicFlag":1,"orderLib":"A","Notes":null,"AccConID":null,"AccessConstraint":null,"LicURL":null}],"litstyles":null,"thespers":null,"arch2discl":null,"SERpubls":[{"PublName":"BioOne","City":"Washington"}],"MONpubls":null,"pictures":[],"thestermsPath":[{"ThesaurusTerm":"Oceanography","ThestID":5712,"Acronym":"ASFA","ThesTermPath":"Aquatic sciences > Marine sciences > Earth sciences > Oceanography"},{"ThesaurusTerm":"Artificial intelligence","ThestID":565,"Acronym":"ASFA","ThesTermPath":"Artificial intelligence"},{"ThesaurusTerm":"Data integration","ThestID":91437,"Acronym":"CSA","ThesTermPath":"Data integration"}],"thestermsASFA":[{"ThesaurusTerm":"Artificial intelligence"},{"ThesaurusTerm":"Data integration"},{"ThesaurusTerm":"Oceanography"}],"taxtermsASFA":null,"geotermsASFA":null,"collections":[{"Collection":"VLIZ Acknowledged Publications","ShortName":"VLIZ ackn"}],"conf":null,"proj":null,"Physdatasets":null,"spcols":{"941":{"SpName":"LifeWatch Species Information Backbone","SpColID":941,"ParSpColID":39,"TopParID":39,"ShortName":"LifeWatch Species Information Backbone","URLLocation":null,"LibID":36,"OpenRepoFlag":null,"SpTypID":null,"TopParIDNotWebsite":39,"SpColPath":"VLIZ ackn/LifeWatch Species Information Backbone"},"793":{"SpName":"Marine Regions acknowledged","SpColID":793,"ParSpColID":941,"TopParID":39,"ShortName":"Marine Regions ackn","URLLocation":null,"LibID":36,"OpenRepoFlag":null,"SpTypID":null,"TopParIDNotWebsite":39,"SpColPath":"VLIZ ackn/LifeWatch Species Information Backbone/Marine Regions ackn"},"39":{"SpName":"VLIZ Acknowledged Publications","SpColID":39,"ParSpColID":null,"TopParID":null,"ShortName":"VLIZ ackn","URLLocation":null,"LibID":36,"OpenRepoFlag":null,"SpTypID":null,"TopParIDNotWebsite":null,"SpColPath":"VLIZ ackn"},"507":{"SpName":"World Register of Marine Species","SpColID":507,"ParSpColID":null,"TopParID":null,"ShortName":"WoRMS website","URLLocation":null,"LibID":null,"OpenRepoFlag":null,"SpTypID":null,"TopParIDNotWebsite":null,"SpColPath":"WoRMS website"},"915":{"SpName":"World Register of Marine Species (WoRMS) acknowledged","SpColID":915,"ParSpColID":941,"TopParID":39,"ShortName":"WoRMS ackn","URLLocation":null,"LibID":36,"OpenRepoFlag":null,"SpTypID":null,"TopParIDNotWebsite":39,"SpColPath":"VLIZ ackn/LifeWatch Species Information Backbone/WoRMS ackn"},"947":{"SpName":"WoRMS ackn - direct reference","SpColID":947,"ParSpColID":915,"TopParID":39,"ShortName":"WoRMS ackn - direct","URLLocation":null,"LibID":36,"OpenRepoFlag":null,"SpTypID":null,"TopParIDNotWebsite":39,"SpColPath":"VLIZ ackn/LifeWatch Species Information Backbone/WoRMS ackn/WoRMS ackn - direct"}},"doi":null,"publs":null,"serparttypes":null,"monauthors":null,"MParts":null,"SParts":null,"hLibs":null,"langs":[{"BEntID":317995,"AbstractFlag":0,"LangID":15,"LangCode":"en","Lang":"English","DutchTerm":"Engels","LangCodeExtended":"eng"},{"BEntID":317995,"AbstractFlag":1,"LangID":15,"LangCode":"en","Lang":"English","DutchTerm":"Engels","LangCodeExtended":"eng"}],"urls":[{"URL":"https://dx.doi.org/10.1525/elementa.418","externalID":"10.1525/elementa.418","URLTypeCode":"DOI","URLID":83446,"URLTypID":13,"URLType":"DOI","URLPrefix":"http://dx.doi.org/"}],"thesterms":[{"ThesaurusTerm":"Artificial intelligence","ThestID":565,"Acronym":"ASFA","ThesTypID":1,"ThesType":"ASFA Thesaurus List"},{"ThesaurusTerm":"Data integration","ThestID":91437,"Acronym":"CSA","ThesTypID":2,"ThesType":"CSA Technology Research Database Master Thesaurus"},{"ThesaurusTerm":"Oceanography","ThestID":5712,"Acronym":"ASFA","ThesTypID":1,"ThesType":"ASFA Thesaurus List"}],"taxterms":null,"geoterms":null,"othterms":null,"asfacodes":null,"asfa2codes":null,"thestermsFRIS":[{"ThesaurusTerm":"Artificial intelligence","DutchTerm":null,"ThestID":565,"Acronym":"ASFA","ThesTypID":1,"ThesType":"ASFA Thesaurus List"},{"ThesaurusTerm":"Data integration","DutchTerm":null,"ThestID":91437,"Acronym":"CSA","ThesTypID":2,"ThesType":"CSA Technology Research Database Master Thesaurus"},{"ThesaurusTerm":"Oceanography","DutchTerm":"Oceanografie","ThestID":5712,"Acronym":"ASFA","ThesTypID":1,"ThesType":"ASFA Thesaurus List"}],"taxtermsFRIS":null,"geotermsFRIS":null,"othtermsFRIS":null,"resmessage":"","complete":1,"sessions":{"newSesName":"Chisala, Chilekwa, C.","newSesDate":{"date":"2020-05-26 08:34:02.470000","timezone_type":3,"timezone":"Europe/Brussels"},"updSesName":"Chisala, Chilekwa, C.","updSesDate":{"date":"2020-06-08 10:02:29.627000","timezone_type":3,"timezone":"Europe/Brussels"}}}
