Coral Market: A self-sovereign back-end for research data-management powered by Ocean Protocol
Part 1 - Proposal Submission
Name of Project
Proposal in one sentence
Grant funds are requested to support the development of an open-source application for GDPR-compliant self-sovereign scientific data management and peer-to-peer sharing.
Description of the project and what problem is it solving
Thousands of petabytes of data on human health, economic activity, social dynamics, and scientific observations of the universe and our impact on it is siloed in legacy institutional web infrastructure.
A key challenge for unlocking scientific data is managing controlled access to sensitive data, such as:
- Personal data provided by a participant in a research study
- Sensitive data that can be used adversarially or for harm
- Proprietary data collected at great expense by an investigator
The emergence of peer-to-peer data storage and standards for decentralised identifiers (DIDs) makes it possible to permanently establish a public records archive in a common web infrastructure accessible to all, regardless of professional/academic status, nationality, language, or age, while respecting the intrinsic self-sovereignty of data providers. Furthermore, 250 TB’s of data will be free to store on Filecoin, which will accelerate the attractiveness of decentralised storage for scientists to use the Web3 ecosystem. Scientific innovation is one of the most important public goods that need critical funding and the adoption of Web3 by scientists is fundamental if we are to develop an ecosystem with public goods at the forefront of how we build a better web.
This grant will support the development of the middleware for decentralised data storage of large, scientific datasets. This will act as the machine for the Opsciverse which will allow for unified permissions management and peer-to-peer sharing of scientific data, directly lowering the barrier to deployment on Ocean markets. The application will be powered by the Interplanetary File System (IPFS) and Filecoin for content-addressable peer-to-peer data storage, the Ocean Market middleware including but not limited to: Aquarius, Ocean JS lib, PY lib, and Ocean Provider.
- [ ] Research data object self-descriptions and mapping to Ocean Protocol Data Token Metadata
- [ ] Architectural specification, diagram, and technical documentation for Coral Market Application Layer
- [ ] Develop best practices for engineers to onboard onto Web3 Big Data projects
- [ ] Publish science-friendly onboarding documentation onto Web3 marketplaces
- [ ] User Research + User Requirement Design Feedback from Scientists
- [ ] Technical research reports on the integration of Filecoin Storage Auctions with Open Science Data aggregators to provide low-cost storage options for decentralized laboratories
- [ ] Rebranded Ocean Market fork on Ethereum / Polygon testnet
Which category best describes your project?
Build/improve applications or integrations to Ocean
Which Fundamental Metric best describes your project?
Other - Our primary goal is to spread the adoption of Web3 technology amongst the scientific community. To reflect this goal, we propose the number of scientific research objects that are created on our platform as a new metric. These projects will be composed of datasets, algorithms, models, and other scientific digital objects, some of which are data tokens. We seek to replicate the OceanDAO model for funding scientific projects pre-registered on our platform. Our product directly influences Ocean’s ROI through the Ocean Marketplace because our data objects will utilise the ocean ecosystem.
What is the final product?
The Coral Market will be a fully functioning Ocean Market fork with an intrinsic token that is used to create data tokens that correspond to digital research objects such as datasets, pre-computed weights for ML, scientific protocols/experiments. Our team will expand the data token metadata specification to include compatibility with a wide set of scientific self-descriptions following the Open Science Framework. The final product will allow researchers all over the world to pre-register their hypotheses, tokenise their intellectual assets,
How does this project drive value to the “fundamental metric” (listed above) and the overall Ocean ecosystem?
The Coral Market is a critical back-end component of the “Opsciverse” - a one-stop-shop for scientists to access cloud services for reproducible open science with big data sets. We will use this time and money to develop comprehensive research and specifications for the building we will start in October.
As part of our grant deliverables, we will generate critical feedback for the OceanDAO to understand current data needs and problems from scientists’ perspectives. We currently have a research agreement with Textile, Filecoin and MIT/Dartmouth to identify decentralized file storage challenges and solutions for big data neuroscience laboratories. We are building tools to migrate up to 250TB of neuroscience data unto decentralized file storage, and making this open data directly available to over 20,000 neuroscience researchers around the world. We expect exponential growth of research projects over a period of 5 years beginning in Q1 2022 as we tailor cloud service needs to researchers to scale adoption.
If awarded, this project will have received a total of 22564 $OCEAN. Our current deliverables for this round include publishing the product specifications of Coral Market for community review, establishing data bridges with scientific research institutions (MIT/Dartmouth), conducting user research with potential scientific data providers and consumers.
We expect our efforts will provide a template for other teams of researchers and scientists looking to build on Ocean. If we capture 10% of the 800+ neurotech labs we have identified, we can expect them to follow our template to unleash their data. Each research project will include multiple data token objects such as data, models, protocols etc. An example of the average cost of gathering a neuroimaging dataset is ~$800 USD per participant. A research laboratory collects upwards of 100 participants per project (lower bound based on typical statistical power required for neuroimaging sample size analyses). We can expect each dataset to be worth 133.333 $OCEAN at current prices. For example, if 10% of the 800+ labs identified follow our template to contribute data to the Ocean Market, we can expect a total value of 10.666.640 $OCEAN (bang) in neuroimaging data staked on the marketplace.
The hypothetical ROI following this model results in a value of >200 (bang/buck) with a 100% chance of success, >100 with a 50% chance, >40 with a 20% chance, >10 with a 10% chance, >1 with a 1% chance. We believe the chance of success of the realized outcome described above grows significantly based on when in time success is assessed, specifically increasing significantly towards the end of the grant’s roadmap for deliverables.
ROI: [bang / buck] x p(success)
Therefore, bang for buck for this proposal can only stand to benefit the Ocean DAO.
Proposal Wallet Address
Have you previously received an OceanDAO Grant?
Current Country of Residence
Opscientia LTD. is a Singapore registered company.
Part 2: Team
Shady El Damaty , M.Sc., Ph.D.
- Role: Cognitive Neuroscientist, Project Lead, Opscientia Founder
- Github: https://github.com/seldamat
- Website: https://seldamat.github.io
- Linkedin: https://www.linkedin.com/in/seldamat/
- Past Experience: Neuroscientist & Big Data Engineer at Georgetown University
Sarah Hamburg , M.Sc., Ph.D.
- Role: Cognitive Neuroscientist, Project Strategist, Opscientia Co-founder
- Github: https://github.com/shamburgularara
- Linkedin: https://www.linkedin.com/in/sarah-hamburg-phd-9510a910a
- Past Experience: Humanisation Lead at JPMorgan Chase & Co. Technology; Next Generation Technology Consultant at Capco
Alexandra McCarroll , M.Sc. (in proc)
- Role: Software & Data Engineer, Opscientia Co-founder
- Github: https://github.com/XandraMcC
- Linkedin: https://www.linkedin.com/in/alexandra-mccarroll-469108133/
- Past Experience: Data Engineer at HSBC Data and Innovation Lab; Full-stack developer & Consultant at Capco
Liliana Muscarella , M.A.
- Role: Social Impact Strategist
- Linkedin: https://www.linkedin.com/in/lilianamuscarella/ 1
- Past Experience: Human Rights Innovator & Researcher
Kinshuk Kashyap, Fellow
- Role: Software Engineer
- Github: https://github.com/kinshukk
- Linkedin: https://www.linkedin.com/in/kinshuk-kashyap-32a4747b/
- Past Experience: Google Summer of Code Scholar
Achintya Kumar, Fellow
- Role: Software Engineer
- Github: https://github.com/Ackintya
- Linkedin: https://www.linkedin.com/in/achintya-kumar1/
- Past Experience: Opscientia Open Web Fellowship
Caleb Tuttle, Fellow
- Role: Software Engineer
- Github: https://github.com/calebtuttle
- Website: https://calebtuttle.github.io
- Linkedin: https://www.linkedin.com/in/caleb-tuttle-20bbb2126/
- Past Experience: Software Engineer at Startup, TaxSlayer
Part 3: Proposal Details
Project Deliverables - Category
- The app will be live, at: https://market.opsci.io
- The project is open-source and can be found (with a permissive license if necessary) at: https://github.com/opscientia
We will be forking Ocean Market, utilising the current middleware. We will also develop documentation to ensure industry best practices are uniform throughout development. The main aim of this project is to research and architect the backend of the “Opsciverse” and carry out user research in order to develop accurate user requirements for our build phase.
Qualitative user research will be conducted with 5+ scientists.
Social media (Twitter & Linkedin) will be used to disseminate our message and engage with the wider community.
Project Deliverables - Roadmap
Any prior work completed thus far?
- Preliminary community outreach has been completed thus far.
What is the project roadmap?
This grant will bootstrap the September phase of our development lifecycle. Articles will be published to the news page of our website to update the community on our happenings.
Team’s future plans and intentions
We plan to request funding from OceanDAO to complete our product development pipeline from September to December. We will post updates and links to deliverables for community feedback at the end of each month. Our goal is to build the critical infrastructure for our Open Science research platform, starting with the Web3 back-end.
This grant is a first step in building a decentralised science platform running self-governed, owned, and automated science activities on-chain.