Due to a lapse in appropriations, the majority of USGS websites may not be up to date and may not reflect current conditions. Websites displaying real-time data, such as Earthquake and Water and information needed for public health and safety will be updated with limited support. Additionally, USGS will not be able to respond to inquiries until appropriations are enacted.  For more information, please see www.doi.gov/shutdown

Community for Data Integration (CDI)

Home

Connect and collaborate

The Community for Data Integration (CDI) is a dynamic community of practice working together to grow USGS knowledge and capacity in scientific data and information management and integration.

Collaboration Areas

Join a group of peers to explore common interests and solve shared challenges.

Browse Areas

Projects and Products

The CDI supports innovative ideas across the community with seed funding for projects.

View Projects

Participate

Participate in monthly forums, topical discussions, trainings, or the annual proposals process.

Join us!

News

Date published: September 26, 2018

FY19 CDI Request for Proposals

Join us this fall for a unique community-driven process for building USGS capabilities in data integration and management.

Date published: April 19, 2018

CDI FY18 Funded Projects Announced

The FY18 CDI Funded Projects are posted on the 2018 Projects page. Congratulations to the project teams!

Publications

Year Published: 2018

Wrangling distributed computing for high-throughput environmental science: An introduction to HTCondor

Biologists and environmental scientists now routinely solve computational problems that were unimaginable a generation ago. Examples include processing geospatial data, analyzing -omics data, and running large-scale simulations. Conventional desktop computing cannot handle these tasks when they are large, and high-performance computing is not...

Erickson, Richard A.; Fienen, Michael N.; McCalla, S. Grace; Weiser, Emily L.; Bower, Melvin L.; Knudson, Jonathan M.; Thain, Greg
Erickson, R.A., Fienen, M.N., McCalla, S.G., Weiser, E.L., Bower, M.L., Knudson, J.M., Thain, G. 2018. Wrangling distributed computing for high-throughput environmental science: An introduction to HTCondor. PLoS Computational Biology. 14(10):e1006468. DOI: 10.1371/journal.pcbi.1006468.

Year Published: 2018

Community for Data Integration 2017 annual report

The Community for Data Integration (CDI) is a group that helps members grow their expertise on all aspects of working with scientific data. The CDI’s activities advance data and information integration capabilities in the U.S. Geological Survey and in the wider Earth and biological sciences. This annual report describes the presentations,...

Hsu, Leslie; Langseth, Madison L.
Hsu, L., and Langseth, M.L., 2018, Community for Data Integration 2017 annual report: U.S. Geological Survey Open-File Report 2018–1110, 19 p., https://doi.org/10.3133/ofr20181110.

Year Published: 2018

U.S. Geological Survey Community for Data Integration 2017 Workshop Proceedings

Executive SummaryThe U.S. Geological Survey (USGS) Community for Data Integration (CDI) Workshop was held May 16–19, 2017 at the Denver Federal Center. There were 183 in-person attendees and 35 virtual attendees over four days. The theme of the workshop was “Enabling Integrated Science,” with the purpose of bringing together the community to...

Hsu, Leslie; Hutchison, Vivian B.; Langseth, Madison L.; Wheeler, Benjamin
Hsu, L., Hutchison, V.B., Langseth, M.L., and Wheeler, B., 2018, U.S. Geological Survey Community for Data Integration 2017 Workshop Proceedings: U.S. Geological Survey Open-File Report 2018–1081, 56 p., https://doi.org/10.3133/ofr20181081.