Renaissance Computing Institute

Last updated
Renaissance Computing Institute (RENCI)
RENCI Europa.jpg
RENCI's main campus at Europa Drive, Chapel Hill, NC
Established2004
Field of research
data science and cyberinfrastructure; environmental sciences; biomedical and health sciences
Director Stanley C. Ahalt, PhD
LocationChapel Hill, NC
Affiliations University of North Carolina at Chapel Hill
Websiterenci.org

Renaissance Computing Institute (RENCI) was launched in 2004 as a collaboration involving the State of North Carolina, University of North Carolina at Chapel Hill (UNC-CH), Duke University, and North Carolina State University. RENCI is organizationally structured as a research institute within UNC-CH, and its main campus is located in Chapel Hill, NC, a few miles from the UNC-CH campus. RENCI has engagement centers at UNC-CH, Duke University (Durham), and North Carolina State University (Raleigh).

Contents

RENCI's founding director was Daniel A. Reed; Stanley C. Ahalt is the current director. RENCI employs over 80 staff members.

Mission statement

RENCI's current mission is: "to develop and deploy advanced technologies to enable research discoveries and practical innovations." [1] RENCI achieves its mission by partnering with academic researchers, governmental policy makers, and industry leaders to engage in research and development aimed at solving critical challenges in several focus areas: data science and cyberinfrastructure; environmental sciences; and biomedical and health sciences.

History

RENCI was founded in January 2004 by Daniel A. Reed, PhD, with funding from the State of North Carolina, UNC-CH, North Carolina State University, and Duke University. [2] [3] Dr. Reed formerly served as director of the National Center for Supercomputing Applications (NCSA), Chief Architect for the National Science Foundation (NSF) TeraGrid initiative, and Member of the President's Information Technology Advisory Committee. In May 2004, Alan Blatecky joined RENCI as deputy director. Mr. Blatecky formerly served as executive director of the San Diego Supercomputer Center and head of the NSF Middleware initiative.

RENCI's initial mission statement was:

to serve as a multidisciplinary institute bridging academe, commerce and society to enrich and empower human potential, create multi-institutional partnerships, and develop and deploy world-leading computational infrastructure.

In December 2005, RENCI received $5.9M in funding from the State of North Carolina for FY2005-2006 and $11.8M in recurring funds for "staff support, computer operations and equipment." This funding was critical for RENCI as it developed a statewide infrastructure to create a virtual organization and leverage that infrastructure and the expertise of RENCI staff in order to engage in federally funded projects of interest to the State. RENCI's initial focus was on applying cyber technologies and advanced analytics to coastal disaster planning, mitigation, and response. RENCI has since engaged in diverse partnerships throughout North Carolina and across the nation. Those partnerships have yielded numerous federal grant awards, thus providing the organization with an additional revenue stream.

RENCI underwent a change in leadership in 2007, with the departure of Dr. Reed and the appointment of Mr. Blatecky as interim director. RENCI implemented its first ever strategic planning process during this time. The process led to a revised mission statement:

The Renaissance Computing Institute, a multi-institutional organization, brings together multidisciplinary experts and advanced technological capabilities to address pressing research issues and to find solutions to complex problems that affect the quality of life in North Carolina, our nation and the world.

In 2009, Stanley C. Ahalt, PhD, was appointed to the position of director. Dr. Ahalt previously served as executive director of the Ohio Supercomputer Center (OSC) and was a professor in the Department of Electrical and Computer Engineering at Ohio State University (OSU). Upon arriving at RENCI, Dr. Ahalt received a joint appointment as professor in the department of computer science at UNC-CH.

Ashok Krishnamurthy, PhD, was appointed as deputy director in February 2013. Dr. Krishnamurthy was previously the director of research and scientific development at OSC and associate professor in the Department of Computer and Electrical Engineering at OSU.

Under the leadership of Drs. Ahalt and Krishnamurthy, RENCI expanded its staff numbers, external partners, and breadth of activities. Several key partnerships and initiatives have been launched. The first is a partnership with the School of Medicine at UNC-CH on a National Institutes of Health (NIH) Center for Translational and Clinical Science award, which led to the establishment of the North Carolina Translational and Clinical Sciences Institute in 2008 (NC TraCS). Drs. Ahalt and Krishnamurthy serve as director and co-director, respectively, of the Biomedical Informatics Service within NC TraCS. A second key activity was the founding of the Water Science Software Institute (WSSI), which was co-founded by RENCI and the National Socio-Environmental Synthesis Center (SESYNC) in September 2012. A third key activity was the creation of the National Consortium for Data Science (NCDS) in February 2013. The NCDS is headquartered at RENCI and includes members drawn from academics, industry, and government. Finally, a fourth key activity was the establishment of the iRODS Consortium in March 2013. The iRODS Consortium also is headquartered at RENCI and includes a diverse international membership.

Current leadership

Key research and development focus areas and technologies

Data science and cyberinfrastructure

RENCI has a number of active research programs that are aimed at developing and deploying advanced computing and networking capabilities. Many of the resultant technologies are open source. For example, the open source ExoGENI (Exo-Global Environment for Network Innovation) is being developed as part of the NSF-funded GENI initiative. [4] [5] ExoGENI functions as a federated, cloud-based Networked Infrastructure-as-a-Service (NIaaS) platform for dynamic provisioning of networking, storage, and compute resources. ADAMANT (Adaptive Data-Aware Multi-domain Application Network Topologies), also funded by the NSF, builds upon ExoGENI. ADAMANT integrates the Pegasus (workflow management) and HT Condor scientific workflow system with the ExoGENI NIaaS platform to orchestrate the execution of large-scale scientific workflows over distributed cloud or traditional high-performance computing resources. iRODS (integrated Rule-Oriented Data System) was developed by the Data Intensive Cyber Environments (DICE) Centers at UNC-CH and the University of California, San Diego and is currently maintained by RENCI. iRODS is an open source middleware technology designed to provide policy-based control over data access, movement, use, and archiving across geographical sites, disparate storage technologies, and multiple user groups, each with varying policies regarding data access and use. [6] [7] [8] [9] RADII (Resource Aware Data-centric collaborative Infrastructure; web citation) integrates GENI's ORCA (Open Resource Control Architecture) with iRODS to dynamically provision a distributed cloud-based infrastructure for multi-institutional, data-driven research collaborations. RADII accomplishes this through software designed to model research data and map data elements, computations, and storage onto the underlying physical infrastructure of iRODS. DataBridge aims to provide a multi-dimensional sociometric network system for sharing long-tail data collections. [10] [11] [12] DataBridge is an open source collaboration tool that allows scientists to explore available data sets and their relevant algorithms and define semantic bridges to link to and access diverse data sets within the sociometric network.

Environmental sciences

Many of RENCI's projects in the Environmental Sciences focus on hydrology, coastal storm surges, and advanced modeling to assist in disaster preparedness. ADCIRC is an open source software model that applies advanced analytics to multiple data sources and types (e.g., hydrology data sets, atmospheric data sets, tropical storm forecasting data, Geographic Information System data, etc.) to enable real-time, high-resolution prediction of the impact of coastal storm surges and flooding after hurricanes and related events. [13] [14] In collaboration with researchers at the UNC Coastal Resilience Center and the National Hurricane Center, ADCIRC is being developed as a coastal forecasting system to assist with state and federal disaster planning and decision support. EarthCube is an NSF-funded initiative that aims "to develop a framework over the next decade to assist researchers in understanding and predicting the Earth system from the Sun to the center of the Earth." [15] [16] EarthCube is being designed as an open dynamic cyberinfrastructure to enable community-governed data sharing across the geosciences, including ocean science, polar studies, atmospheric science, geospace, computer science, and other fields. HydroShare is supported by the NSF-funded CUAHSI (Consortium of Universities for the Advancement of Hydrologic Science Inc.) and is under development as an open collaboration cyberinfrastructure for hydrology. [17] [18] [19] HydroShare allows water scientists to identify and retrieve water-related data sets and associated algorithms and models and then analyze and compute on the data using a distributed computing environment that includes grid-based cloud and high-performance computing and storage capabilities

Biomedical and health sciences

A major focus of RENCI's work in the Biomedical and Health Sciences is clinical genomics. RENCI works with NC TraCS, the Lineberger Comprehensive Cancer Center at UNC-CH, and UNC's Information Technology Services Research Computing Division to develop and implement technologies to support next-generation genomic sequencing technologies, such as Whole Genome Sequencing (WGS) and Whole Exome Sequencing (WES). These technologies include the GMW (Genetic Medical Workflow) Engine, which was funded in part by the NIH and provides end-to-end capture, analysis, validation, and reporting of WGS and WES data. The GMW Engine is designed as open source architecture that coordinates workflows, sub-workflows, samples, data, and people to support all aspects of genomics research and clinical application, from the initial patient visit to the physician-guided reporting of genomic findings. [20] MapSeq (Masively Parallel Sequencing) is an open source plugin-based Service-Oriented Architecture (SOA) that provides secure management and execution of the complex downstream computational and analytical steps involved in high-throughput genomic sequencing and other data-intensive applications. [21] MaPSeq and its homegrown sister technology, GATE (Grid Access Triage Engine), are built on top of Apache Karaf and together provide extensible capabilities for downstream analysis of genomic data and other large data sets, including workflow pipeline execution and management, meta-scheduling of workflow jobs, opportunistic use of compute resources, secure data transfer, and web-based client access. CANVAS (CAroliNa Variant Annotation Store) and AnnoBot (Annotation Bot) work together to provide version-controlled annotation and metadata for genomic variant data in order to support up-to-date clinical interpretation of genomic variants and thereby guide clinical decision making. [22] CANVAS is designed as an open source, relational PostgreSQL relational database that stores genomic variant data with associated annotation and metadata. AnnoBot consists of Python modules and software driver code configured to provide automated monitoring and retrieval of external data sources for annotation updates. CHAT (Convergent Haplotype Association Tagging) is a software algorithm that allows for the identification of moderately penetrant genomic variants using cross-population genetic structures. CHAT invokes a graph theory–based algorithm to determine the haplotype phase of a population of unrelated individuals by: identifying subsets of individuals that share a region of the genome through descent; and then generating a consensus haplotype for the shared region. [23] The SMW (Secure Medical Workspace) provisions a secure environment for access to sensitive patient data for clinical care or Institutional Review Board–approved clinical research. [24] [25] The open source SMW architecture uses virtualization technology (i.e., VMWare) and Data Leakage Protection (DLP) technology (i.e., WebSense) to create a secure virtual workspace coupled with the ability to prevent (or allow with a challenge and auditing by Information Technology staff) the physical removal of data from a central, secure storage environment.

Institutes and consortiums

RENCI pioneered the establishment of a national institute, the WSSI, and two major consortiums, the iRODS Consortium and the NCDS.

WSSI

The NSF-funded WSSI was established in September 2012 as a collaboration between RENCI and SESYNC. The mission of the WSSI is to "enable and accelerate new transformative water science by concurrently transforming both the software culture and the research culture of the water science community." [26] [27] When it is fully operational, the WSSI aims to operate under the Open Community Engagement Model, which will integrate multiple NSF-funded initiatives (Synthesis Centers, Environmental Observatories, Software Sustainability Institutes, etc.) to distill data, ideas, theories, and methods and thereby provide synthetic information to address water science challenges that cannot be addressed using traditional disciplinary methods. The activities of the WSSI focus on the development of an open community and the promotion of open source and agile software development in order accelerate transformative water science research. In addition to RENCI and SESYNC, current members include the Institute for the Environment at UNC-CH, University of Illinois Urbana-Champaign, University of Michigan, University of Maryland, NCSA, RedHat, National Oceanic and Atmospheric Administration, and IBM.

NCDS

The NCDS was established by RENCI in February 2013 as a public/private partnership of leading universities, governmental and non-profit agencies, and businesses devoted to advancing data science, which the NCDS defines as "the systematic study of the organization and use of digital data in order to accelerate discovery, improve critical decision-making processes, and enable a data-driven economy." [28] The mission of the NCDS is "to provide the foundation needed to advance data science research, education, and economic opportunity." The NCDS works toward this mission by providing intellectual leadership and hosting numerous workshops, an academic-industry faculty fellowship, a Data Matters Summer Short Course series, student career events, invited talks, and summit meetings. In addition, the NCDS sponsors a Data Observatory, which provides a shared federated infrastructure for data sharing and computing. The NCDS also partners with numerous regional efforts in data science, including Datapalooza, Triangle Open Data Day, Pearl Hacks, Data4Decisions, Analytics Forward UnConference, and others. As of June 2015, the NCDS comprises 15 member organizations, with 8 based in North Carolina and 4 multinational companies with a strong presence in the Research Triangle Park, NC area.

iRODS Consortium

The iRODS Consortium was founded by RENCI in March 2013 and is headquartered at RENCI, as is the main iRODS development team. The mission of the consortium is "to ensure the sustainability of the integrated Rule-Oriented Data System (iRODS) and to further its adoption and continued evolution." [29] To achieve its mission, the consortium works to develop standards for the open source iRODS technology and its future development, promote advancements for the technology, and expand the user base. The consortium also supports the development of a mission-critical, production-level version of iRODS (currently v4.1). The iRODS Consortium includes a diverse membership of iRODS user organizations from around the world. Current consortium members include RENCI, the DICE Centers at UNC-CH and the University of California, San Diego, DataDirect Networks, Seagate Technology, Wellcome Trust Sanger Institute, EMC Corporation (EMC2), IBM, and NASA's Atmospheric Science Data Center.

Related Research Articles

<span class="mw-page-title-main">Bioinformatics</span> Computational analysis of large, complex sets of biological data

Bioinformatics is an interdisciplinary field of science that develops methods and software tools for understanding biological data, especially when the data sets are large and complex. Bioinformatics uses biology, chemistry, physics, computer science, computer programming, information engineering, mathematics and statistics to analyze and interpret biological data. The subsequent process of analyzing and interpreting data is referred to as computational biology.

<span class="mw-page-title-main">Fred Brooks</span> American computer scientist (1931–2022)

Frederick Phillips Brooks Jr. was an American computer architect, software engineer, and computer scientist, best known for managing the development of IBM's System/360 family of computers and the OS/360 software support package, then later writing candidly about those experiences in his seminal book The Mythical Man-Month.

ibiblio Digital library and archive project

ibiblio is a "collection of collections", and hosts a diverse range of publicly available information and open source content, including software, music, literature, art, history, science, politics, and cultural studies. As an "Internet librarianship", ibiblio is a digital library and archive project. It is run by the School of Information and Library Science and the School of Journalism and Mass Communication at the University of North Carolina at Chapel Hill, with partners including the Center for the Public Domain, IBM, and SourceForge. It also offers streaming audio radio stations. In November 1994 it started the first internet radio stream by rebroadcasting WXYC, the UNC student-run radio station. It also takes credit for the first non-commercial IPv6 / Internet2 radio stream. Unless otherwise specified, all material on ibiblio is assumed to be in the public domain.

E-Science or eScience is computationally intensive science that is carried out in highly distributed network environments, or science that uses immense data sets that require grid computing; the term sometimes includes technologies that enable distributed collaboration, such as the Access Grid. The term was created by John Taylor, the Director General of the United Kingdom's Office of Science and Technology in 1999 and was used to describe a large funding initiative starting in November 2000. E-science has been more broadly interpreted since then, as "the application of computer technology to the undertaking of modern scientific investigation, including the preparation, experimentation, data collection, results dissemination, and long-term storage and accessibility of all materials generated through the scientific process. These may include data modeling and analysis, electronic/digitized laboratory notebooks, raw and fitted data sets, manuscript production and draft versions, pre-prints, and print and/or electronic publications." In 2014, IEEE eScience Conference Series condensed the definition to "eScience promotes innovation in collaborative, computationally- or data-intensive research across all disciplines, throughout the research lifecycle" in one of the working definitions used by the organizers. E-science encompasses "what is often referred to as big data [which] has revolutionized science... [such as] the Large Hadron Collider (LHC) at CERN... [that] generates around 780 terabytes per year... highly data intensive modern fields of science...that generate large amounts of E-science data include: computational biology, bioinformatics, genomics" and the human digital footprint for the social sciences.

Alexander Tropsha is a chemist and professor at the University of North Carolina - Chapel Hill. Tropsha is Associate Dean for Pharmacoinformatics and Data Science at the UNC Eshelman School of Pharmacy. His primary fields of research are cheminformatics and quantitative structure-activity relationship (QSAR) modeling in the context of drug discovery. As of 2015, Tropsha has been an associate editor of the American Chemical Society’s Journal of Chemical Information and Modeling.

The cancer Biomedical Informatics Grid (caBIG) was a US government program to develop an open-source, open access information network called caGrid for secure data exchange on cancer research. The initiative was developed by the National Cancer Institute and was maintained by the Center for Biomedical Informatics and Information Technology (CBIIT) and program managed by Booz Allen Hamilton. In 2011 a report on caBIG raised significant questions about effectiveness and oversight, and its budget and scope were significantly trimmed. In May 2012, the National Cancer Informatics Program (NCIP) was created as caBIG's successor program.

<span class="mw-page-title-main">Julian Lombardi</span> American computer scientist

Julian Lombardi is an American inventor, author, educator, and computer scientist known for his work with socio-computational systems, scalable virtual world technologies, and in the design and deployment of deeply collaborative virtual learning environments.

The myGrid consortium produces and uses a suite of tools design to “help e-Scientists get on with science and get on with scientists”. The tools support the creation of e-laboratories and have been used in domains as diverse as systems biology, social science, music, astronomy, multimedia and chemistry.

<span class="mw-page-title-main">Apache Taverna</span>

Apache Taverna was an open source software tool for designing and executing workflows, initially created by the myGrid project under the name Taverna Workbench, then a project under the Apache incubator. Taverna allowed users to integrate many different software components, including WSDL SOAP or REST Web services, such as those provided by the National Center for Biotechnology Information, the European Bioinformatics Institute, the DNA Databank of Japan (DDBJ), SoapLab, BioMOBY and EMBOSS. The set of available services was not finite and users could import new service descriptions into the Taverna Workbench.

<span class="mw-page-title-main">Henry Fuchs</span> American computer graphics researcher (born 1948)

Henry Fuchs is a fellow of the American Academy of Arts and Sciences (AAAS) and the Association for Computing Machinery (ACM) and the Federico Gil Professor of Computer Science at the University of North Carolina at Chapel Hill (UNC). He is also an adjunct professor in biomedical engineering.

Physiomics is a systematic study of physiome in biology. Physiomics employs bioinformatics to construct networks of physiological features that are associated with genes, proteins and their networks. A few of the methods for determining individual relationships between the DNA sequence and physiological function include metabolic pathway engineering and RNAi analysis. The relationships derived from methods such as these are organized and processed computationally to form distinct networks. Computer models use these experimentally determined networks to develop further predictions of gene function.

<span class="mw-page-title-main">Galaxy (computational biology)</span>

Galaxy </ref> is a scientific workflow, data integration, and data and analysis persistence and publishing platform that aims to make computational biology accessible to research scientists that do not have computer programming or systems administration experience. Although it was initially developed for genomics research, it is largely domain agnostic and is now used as a general bioinformatics workflow management system.

myExperiment is a social web site for researchers sharing research objects such as scientific workflows.

<span class="mw-page-title-main">Edison Liu</span>

Edison T. Liu is an American chemist who is the former president and CEO of The Jackson Laboratory, and the former director of its NCI-designated Cancer Center (2012-2021). Before joining The Jackson Laboratory, he was the founding executive director of the Genome Institute of Singapore (GIS), chairman of the board of the Health Sciences Authority, and president of the Human Genome Organization (HUGO) (2007-2013). As the executive director of the GIS, he brought the institution to international prominence as one of the most productive genomics institutions in the world.

Ming C. Lin is an American computer scientist and a Barry Mersky and Capital One Endowed Professor at the University of Maryland, College Park, where she is also the former chair of the Department of Computer Science. Prior to moving to Maryland in 2018, Lin was the John R. & Louise S. Parker Distinguished Professor of Computer Science at the University of North Carolina at Chapel Hill.

<span class="mw-page-title-main">UNC School of Information and Library Science</span>

The UNC School of Information and Library Science(SILS) is a professional school at the University of North Carolina at Chapel Hill offering a bachelor's degree in information science, master's degrees in library science and information science, a professional science master's degree in digital curation, and a doctoral degree in information and library science as well as an undergraduate minor, graduate certificate programs, and a post-masters certificate.

The SHIWA project within grid computing was a project led by the LPDS of MTA Computer and Automation Research Institute. The project coordinator was Prof. Dr. Peter Kacsuk. It started on 1 July 2010 and lasted two years. SHIWA was supported by a grant from the European Commission's FP7 INFRASTRUCTURES-2010-2 call under grant agreement n°261585.

The BioCompute Object (BCO) project is a community-driven initiative to build a framework for standardizing and sharing computations and analyses generated from High-throughput sequencing. The project has since been standardized as IEEE 2791-2020, and the project files are maintained in an open source repository. The July 22nd, 2020 edition of the Federal Register announced that the FDA now supports the use of BioCompute in regulatory submissions, and the inclusion of the standard in the Data Standards Catalog for the submission of HTS data in NDAs, ANDAs, BLAs, and INDs to CBER, CDER, and CFSAN.

<span class="mw-page-title-main">Srinivas Aluru</span> American computer scientist

Srinivas Aluru is a professor in the School of Computational Science and Engineering at Georgia Institute of Technology, and co-Executive Director for the Georgia Tech Interdisciplinary Research Institute in Data Engineering and Science. His main areas of research are high performance computing, data science, bioinformatics and systems biology, combinatorial methods in scientific computing, and string algorithms. Aluru is a Fellow of the American Association for the Advancement of Science (AAAS) and the Institute for Electrical and Electronic Engineers (IEEE). He is best known for his research contributions in parallel algorithms and applications, interdisciplinary research in bioinformatics and computational biology, and particularly the intersection of these two fields.

Patrick F. Sullivan FRANZCP is an American psychiatric geneticist. He is the Yeargen Distinguished Professor of Psychiatry and Genetics at the University of North Carolina at Chapel Hill, where he is also the director of the Center for Psychiatric Genomics and the lead principal investigator of the Psychiatric Genomics Consortium. He is also a professor at the Karolinska Institutet in Stockholm, Sweden. His research focuses on the genetics of schizophrenia, major depressive disorders such as post-partum depression, eating disorders, and autism.

References

  1. RENCI Website, renci.org.
  2. The University of North Carolina at Chapel Hill. (2004-2010). Annual Financial Reports. Chapel Hill, North Carolina: University of North Carolina at Chapel Hill. Available at: http://research.unc.edu/offices/vice-chancellor/about/annual-reports.
  3. The University of North Carolina at Chapel Hill. (2011-2014). Comprehensive Annual Financial Reports. Chapel Hill, North Carolina: University of North Carolina at Chapel Hill. Available at: http://finance.unc.edu/reports-and-data/financial-statements-archive.
  4. Baldin, I., Ruth, P., Xin, Y., Mandal, A., Chase, J., Tilson, J., & Prasad. S. (2013). Visions of a Future Internet: The ExoGENI Example. RENCI/NCDS, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA. doi : 10.7921/G0CC0XM1.
  5. Baldin, I., Xin, Y., Mandal, A., Ruth, P., Heerman, C., & Chase, J. (2012). ExoGENI: A Multi-Domain Infrastructure-as-a-Service Testbed. Proceedings of the 8th International ICST Conference on Testbeds and Research Infrastructures for the Development of Networks and Communities (TridentCom). Available at: https://www.cs.duke.edu/~chase/exogeni.pdf.
  6. Rajasekar, A.; Moore, R.; Hou, C.; Lee, C. A.; Marciano, R.; de Torcy, A.; Wan, M.; Schroeder, W.; Chen, S.; Gilbert, L.; Tooby, P.; Zhu, B. (2010a). "iRODS Primer: Integrated Rule-Oriented Data System". Synthesis Lectures on Information Concepts, Retrieval, and Services. 2 (1): 1–143. doi:10.2200/s00233ed1v01y200912icr012.
  7. Rajasekar, A., Moore, R., Wan, M., Schroeder, W., & Hasan, A. (2010b). Applying Rules as Policies for Large-Scale Data Sharing. Intelligent Systems, Modelling and Simulation (ISMS), 2010 International Conference on Intelligent Systems, Modelling and Simulation. Available at: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5416072.
  8. Schmitt, C., Wilhelmsen, K., Krishnamurthy, A., Ahalt, S. & Fecho, K. (2013b). Security and Privacy in the Era of Big Data: iRODS, a Technological Solution to the Challenge of Implementing Security and Privacy Policies and Procedures. RENCI, University of North Carolina at Chapel Hill. Available at: http://www.renci.org/wp-content/uploads/2014/02/0313WhitePaper-iRODS.pdf.
  9. Fortner, B., Ahalt, S., Coposky, J., Fecho, K., Krishnamurthy, A., Moore, R., Rajasekar, A., Schmitt, C., & Schroeder, W. (2014). Control Your Data: iRODS, the integrated Rule-Oriented Data System. RENCI, University of North Carolina at Chapel Hill. Available at: http://renci.org/wp-content/uploads/2014/07/0214WhitePaper-iRODS-2-FINAL-v6.pdf.
  10. Rajasekar, A., Kum, H., Crosas, M., Crabtree, J., Sankaran, S., Lander, H., Carsey, T., King, G., & Zhan, J. (2013a). The DataBridge. Science Journal, ASE. Available at: http://databridge.web.unc.edu/files/2013/01/DataBridge_Journal_Final.pdf.
  11. Rajasekar, A., Sankaran, S., Lander, H., Carsey, T., Crabtree, J., Kum, H., Crosas, M., King, G., & Zhan, J. (2013b). Sociometric Methods for Relevancy Analysis of Long Tail Science Data. ASE/IEEE International Conference on Big Data. Available at: http://databridge.web.unc.edu/files/2013/01/Databridge-ConferenceVersion-final.pdf.
  12. Crabtree, J. (2013). DataBridge: Building an e-Science Collaboration Environment Tool for Linking Diverse Datasets into a Sociometric Network. IASSIST 2013. Available at: http://databridge.web.unc.edu/files/2013/07/IASSIST2013DataBridgeFinal.ppt.
  13. Luettich, R. A., Westerink, J. J., & Scheffner, N. W. (1992). ADCIRC: an Advanced Three-dimensional Circulation Model for Shelves Coasts and Estuaries, Report 1: Theory and Methodology of ADCIRC-2DDI and ADCIRC-3DL. Dredging Research Program Technical Report DRP-92-6. U.S. Army Waterways Experiment Station, Vicksburg, MS. Available at: http://www.dtic.mil/dtic/tr/fulltext/u2/a261608.pdf.
  14. Westerink, J.; Luettich, R.; Feyen, J.; Atkinson, J.; Dawson, C.; Roberts, H.; Powell, M.; Dunion, J.; Kubatko, E.; Pourtaheri, H. (2008). "A Basin- to Channel-scale Unstructured Grid Hurricane Storm Surge Model Applied to Southern Louisiana". Monthly Weather Review. 136 (3): 833–864. Bibcode:2008MWRv..136..833W. doi: 10.1175/2007MWR1946.1 .
  15. Caron, B. (2011). EarthCube Governance Whitepaper: Realizing Expectable Returns on EarthCube Investments in Community Building and Democratic Governance. New Media Research Institute, Santa Barbara, CA. Available at: http://semanticommunity.info/@api/deki/files/13792/=004_Caron.pdf.
  16. Gil, Y., Chan, M., Gomez, B., & Caron, B. (2014). EarthCube: Past, Present, and Future. EarthCube Project Report. EC, 3. Available at: http://earthcube.org/file/3616/download?token=bVernkf4.
  17. Tarboton, D. G., Idaszak, R., Horsburgh, J. S., Heard, J., Ames, D., Goodall, J. L., Band, L. E., Merwade, V., Couch, A., Arrigo, J., Hooper, R., Valentine, D., & Maidment, D. (2014a). A Resource Centric Approach for Advancing Collaboration Through Hydrologic Data and Model Sharing. 11th International Conference on Hydroinformatics, HIC 2014, New York City, USA. Available at: http://www.hic2014.org/proceedings/bitstream/handle/123456789/1539/1566.pdf?sequence=1&isAllowed=y.
  18. Tarboton, D. G., Idaszak, R., Horsburgh, J. S., Heard, J., Ames, D., Goodall, J. L., Band, L. E., Merwade, V., Couch, A., Arrigo, J., Hooper, R., Valentine, D., & Maidment, D. (2014b). HydroShare: Advancing Collaboration through Hydrologic Data and Model Sharing. In D. P. Ames, N. W. T. Quinn, and A. E. Rizzoli (eds), Proceedings of the 7th International Congress on Environmental Modelling and Software. San Diego, California: International Environmental Modelling and Software Society (iEMSs). ISBN   978-88-9035-744-2. http://www.iemss.org/sites/iemss2014/papers/iemss2014_submission_243.pdf.
  19. Heard, J., Tarboton, D., Idaszak, R., Horsburgh, J., Ames, D., Bedig, A., Castronova, A. M., Couch, A., Dash, P., Frisby, C., Gan, T., Goodall, J., Jackson, S., Livingston, S., Maidment, D., Martin, N., Miles, B., Mills, S., Sadler, J., Valentine, D., & Zhao, L. (2014). An Architectural Overview of Hydroshare, A Next-Generation Hydrologic Information System. 11th International Conference on Hydroinformatics, HIC 2014, New York City, USA. Available at: http://www.hic2014.org/proceedings/bitstream/handle/123456789/1536/1562.pdf?sequence=1&isAllowed=y.
  20. Owen, P., Ahalt, S., Berg, J., Coyle, J., Evans, J., Fecho, K., Gillis, D., Schmitt, C., Young, D. & Wilhelmsen, K. (2014): Technologies for Genomic Medicine: The GMW, A Genetic Medical Workflow Engine. RENCI, University of North Carolina at Chapel Hill. doi : 10.7921.G0KW5CXC. Available at: http://renci.org/technical-reports/tr-14-02-the-gmw-a-genetic-medical-workflow-engine.
  21. Reilly, J., Ahalt, S., Fecho, K., Jones, C., McGee, J., Roach, J., Schmitt, C., & Wilhelmsen, K. (2014). Technologies for Genomic Medicine: MaPSeq, A Computational and Analytical Workflow Manager for Downstream Genomic Sequencing. RENCI, University of North Carolina at Chapel Hill. doi : 10.7921.G0VD6WCF. Available at: http://renci.org/technical-reports/mapseq-computational-and-analytical-workflow-manager.
  22. Bizon, C., Ahalt, S., Fecho, K, Nassar, N., Schmitt, C., Scott, E., & Wilhelmsen, K. (2014). Technologies for Genomic Medicine: CANVAS and AnnoBot, Solutions for Genomic Variant Annotation. RENCI, University of North Carolina at Chapel Hill. doi : 10.7921.G0QN64N3. Available at: http://renci.org/technical-reports/tr-14-04-canvas-and-annobot-solutions-for-genomic-variant-annotation.
  23. Webb, A. E. (2011). Linkage, Association, And Haplotype Analysis: A Spectrum Of Approaches To Elucidate The Genetic Influences Of Complex Human Traits (Doctoral dissertation). Retrieved from UNC electronic theses and dissertations collection. (cdm 3992)
  24. Schmitt, C., Shoffner, M., Owen P., Wang, X., Lamm, B., Mostafa, J., Barker, M., Krishnamurthy, A., Wilhelmsen, K., Ahalt, S., & Fecho, K. (2013a). Security and Privacy in the Era of Big Data: The SMW, a Technological Solution to the Challenge of Data Leakage. RENCI, University of North Carolina at Chapel Hill. Available at: http://www.renci.org/wp-content/uploads/2014/02/0213WhitePaper-SMW.pdf.
  25. Shoffner, M.; Owens, P.; Mostafa, J.; Lamm, B.; Wang, X.; Schmitt, C. P.; Ahalt, S. P. (2014). "The Secure Medical Research Workspace: An IT Infrastructure to Enable Secure Research on Clinical Data". Clinical and Translational Science. 6 (3): 222–225. doi:10.1111/cts.12060. PMC   3682797 . PMID   23751029.
  26. Ahalt, S., Minsker, B., Tiemann, M., Band, L., Palmer, M., Idaszak, R., Lenhardt, C., & Whitton, M. (2013). Water Science Software Institute: An Open Source Engagement Process. Proceedings of the 5th International Workshop on Software Engineering for Computational Science and Engineering, 40-47. Available at: http://waters2i2.org/documents/2013/05/water-science-software-institute-an-open-source-engagement-approach.pdf.
  27. Lenhardt, W.C., Ahalt, S., Jones, M., Aukema, J., Hampton, S., Idaszak, R., Rebich-Hespanh, S., & Schildhauer, M. (2014). ISEES-WSSI Lessons for Sustainable Science Software from an Early Career Training Institute on Open Science Synthesis. figshare. Available at: http://files.figshare.com/1796332/ISEES_WSSI_TrainingInst_REV_20141115.pdf.
  28. Ahalt, C. S., Bizon, C., Evans, J., Erlich, Y., & Ginsburgh, G. S., Krishnamurthy, A., Lange, L., Maltbie, D., Masys, D., Schmitt, C., Wilhelmsen, K. (2014). Data to Discovery: Genomes to Health. A White Paper from the National Consortium for Data Science. RENCI, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA. Available at: doi: 10.7921.G03X84K4. Available at: http://data2discovery.org/dev/wp-content/uploads/2014/02/NCDS-Summit-2013.pdf.
  29. iRODS Consortium. By Laws. December 01, 2014. Available at: http://irods.org/wp-content/uploads/2014/12/iRODS_ByLaws_V1120114.pdf.

35°56′22″N79°01′08″W / 35.939561°N 79.018753°W / 35.939561; -79.018753