The CIS Data Science Practice offers services related to acquiring, managing, and integrating data sources for research, and performing data and text mining. On the management side, we can help with data transfers; extract, transform, load (ETL) processes; data warehousing, and integration of disparate data; platforms for storing and accessing both small data and big data; and optimizing data access. On the analysis side, we can help with methods for mining structured data from unstructured data and for textual analysis.
We provide more specialized services for bioinformatics and computational biology through the Computational Biology Core, which is located in the Data Science Practice.
In all of these areas, we apply industry best-practices for software engineering to build robust pipelines at scale. We also provide release engineering for open-source software to help researchers disseminate their methods, publish reproducible results, and promote open science.
Request this Service