Tapping into the Stream of Big Neuroscience Data through Ocean Protocol as a Vessel for Open Science and Reproducibility Principles
Website
Proposal Wallet Address
0x33359285f30e7b3386de70ca500f4fe27853765b
(opscientia.eth)
Current Country of Residence
Opscientia LTD. is a Singapore registered company.
Email address
Twitter Handle
@opscientia
Discord
One-Sentence Summary
I am requesting a continuation for our grant to execute our timeline for bootstrapping an existing community of neuroscientists to begin working with the Ocean Protocol and unleash neuroimaging data.
Categories Describing the Project
[1] Outreach / community / spread awareness (grants don’t need to be technical in nature)
Funding Amount Requested
40.000 $OCEAN / $17.600
Remaining Treasury Balance
Mission Coral has received 51.042 $OCEAN from previous funding rounds (III,IV,V), a total of 501 $OCEAN remains in our Treasury.
Which Fundamental Metric best describes your project? Pick one.
- Network Revenue
- $ Proposed / $ Funded (Your project helps generate, and fund more proposals)
- Other.
Our proposal is bootstrapping a Science Data DAO that will generate datasets and algorithms to host on our fork of the Ocean Marketplace and will provide templates for other groups to replicate our efforts leading to new future proposals.
Project Overview
The field of neuroscience has generated an ocean of data, estimated to be in excess of an exabyte, collected from brain imaging studies of fish, rodents, primates, and humans. Most of this data remains siloed in individual labs - often in proprietary formats and inaccessible to the public. Recent movements in the neuroimaging community have embraced open science principles and a trickling stream of curated datasets has begun to flow onto public internet forums, making detailed measurements of the brain available in great detail for the first time.
Problem 1: Educating Neuroscientists about the Decentralized Web and Data Economy through Ocean Protocol
A major obstacle preventing this trickle from cascading into a torrent is the absence of incentives to share expensive and meticulously collected data on public platforms where others may take credit or abuse the data. The Ocean protocol has taken steps to remove this obstacle with the introduction of data tokens, marketplaces, and confidential cloud computing. However, the concepts of tokenization, blockchain, and the potential of a distributed web 3.0 have yet to penetrate the neuroscience community.
Solution 1: Educational Events, Materials and Data Bounties for Neuroscientists
I am requesting a grant to bootstrap an existing community of neuroscientists to begin working with the Ocean Protocol to unleash neuroimaging data.
The grant will make possible the creation of educational materials, monitored community forums, virtual workshops, and hack-a-thons, in addition to providing data and developer bounties. These goals are set with the purpose of onboarding neuroscientists onto the data economy through the Ocean Protocol.
This grant will bootstrap an existing community of neuroscience data engineers, researchers, and software developers through the International Neuroinformatics Coordinating Facility (INCF), Organization for Human Brain Mapping (OHBM), Georgetown University Methods Lab, and the Global Brain Hack.
We have assembled a list of 800+ research laboratories and neurotech companies and plan to survey current research data management needs and the intersection of priorities with the value proposition of the Ocean Protocol.
Problem 2: Addressing Core Research Questions for Storing and Retrieving Data on Web 3.0
A remaining challenge for neuroscientists is archiving and sharing massive versioned datasets, particularly those that are composed of many sub-datasets being served by multiple groups across a variety of storage locations (institutional computing centers, cloud service providers, individual lab systems). Opscientia’s team is currently exploring the capabilities for serving massive (250TB+) multi-scale images of the brain using decentralized file storage with the Textile Powergate in integration with Ocean Protocol.
Solution 2: Development of Decentralized Science Stack for Performance Testing Web 3.0 Solutions for Scientific Research Data Management
In this project, we will perform use-case testing deploying data via Textile to Filecoin as a file storage protocol serving large datasets indexed by the metadata aggregator datalad to be archived, and later visualized by researchers across an HTTPS gateway using in-browser rendering applications (i.e., neuroglancer). Metadata will be used to generate tokenized datasets on Ocean Market using Textile/Ocean integration.
Research Questions
Large neuroscience datasets require specialized workflows for breaking datasets down into storage-efficient chunks, ensuring persistent availability, validation of interoperability specifications, and managing metadata for linked resources.
The Opscientia team will test the following questions starting with small datasets for unit-tests then building up to 284TB of mixed content datasets indexed by the metadata aggregator datalad.
What is the Expected ROI?
Simple educational events will introduce neuroscientists to existing resources for unleashing their data to share and build upon with colleagues. Over time and as interest builds around the world, adoption is expected to snowball. Likewise, consumption of data tokens will be a trickle at first as we develop pipelines for tokenizing neuroscience datasets on the blockchain.
This grant will also generate critical feedback for the Ocean Protocol to understand current data needs and problems from scientists’ perspectives. Early ROI will likely be low as hidden barriers to adoption are revealed, but a large ROI > 1 can be expected as up to 20k academic neuroscience data providers around the world become activated as consumers and contributors to the Ocean Protocol.
We will establish quantitative metrics for hidden barriers and attitudes towards data markets, Web 3.0, and the Ocean Protocol’s services by deploying user research materials, such as workshops and surveys, with members in the neuroimaging community. This will help us establish a market segment of potential contributors to the data economy and more accurately predict future ROI.
We are projecting a short-term goal for hosting 1 neuroimaging dataset on Ocean Marketplace and up to 10 models for consumption on compute-to-data by the end of our outreach roadmap to demonstrate proof-of-concept for the neuroimaging community. We can model the ROI of future Unleash Data campaigns based on the success of these pools.
The team has already led 5+ hack-a-thon projects themed around the neuroscience data economy and has received a warm reception from the neuroscience community! We also recently received a finalist prize at the May EthGlobal Hackathon and second place in DoraHacks for our work on the Open Science Data Wallet.
ROI: buck
If awarded, this project would have received 83.042 $OCEAN in grant funds (buck). Our current deliverables of ROI feeding back into the system include growing a Data DAO (63 total community members, ~12 active contributors) running on Ocean Marketplace, supporting hack-a-thon teams building on Ocean, providing mentorship to summer undergraduate research fellows working on Ocean Protocol projects, and engaging with scientists at scientific conferences.
ROI: bang
We expect our efforts will provide a template for other teams of researchers and scientists looking to build on Ocean. If we capture 10% of the 800+ neurotech labs we have identifed, we can expect them to follow our template to unleash their data. The average cost of neuroimaging dataset is ~$800 USD per participant. A research laboratory collects upwards of 100 participants per project (lower bound based on typical statistical power required for neuroimaging sample size analyses). We can expect each dataset to be worth 133.333 $OCEAN at current prices. For example, if 10% of the 800+ labs identified follow our template to contribute data to the Ocean Market, we can expect a total value of 10.666.640 $OCEAN (bang) in neuroimaging data staked on the market place.
The hypothetical ROI following this model results in a value of >200 (bang/buck) with a 100% chance of success, >100 with a 50% chance, >40 with a 20% chance, >10 with a 10% chance, >1 with a 1% chance. We believe the chance of success of the realized outcome described above grows significantly based on when in time success is assessed, specifically increasing significantly towards the end of the grant’s roadmap for deliverables.
ROI: [bang / buck] x p(success)
Therefore, bang for buck for this proposal can only stand to disproportionately benefit the Ocean DAO
Grant Deliverables
This grant supports the following deliverables:
- Conference Workshops & Seminars
- Google summer of code projects on the data economy
- Decentralized storage of versioned BIDS datasets with IPFS, datalad and Ceramic
- Securing BIDS Neuroimaging Datasets Stored on Decentralized File Sharing Networks with Self-Sovereign DIDs and IPFS
-
Hack-a-Thons on Data Interoperability Standards for Web 3.0 and ETHGlobal Hack-a-Thon for Opscientia’s Open Science Data Wallet
-
Corporate partnerships with Neuroinformatics Consortia and to support development of open source toolkits for the neuroscience community
-
Survey and quantitative metrics regarding attitudes toward the Data Economy, Ocean Academy, and Web 3.0
-
Providing initial liquidity to publish the first neuroimaging dataset pool and applications for compute-to-data
-
Content in the form of videos, code walk throughs, articles, opinion pieces hosted on the OpscientiaDAO blog
-
Stipend supplements and sponsorships for fellows and scholars contributing to decentralized science open source development, such as the Opscientia Open Web Fellowship
-
Reports on Tested Schemas for Decentralized File Storage with Ocean Market Data Tokenization
Roadmap
- March 1-9 — Google Summer of Code: Student/Mentor Q&A
- April 19 - 23 — Presentation at the Neuroinformatics Assembly
- April 14 - May 2 — Google Summer of Code: Student/Mentor Matching
- May 14 — Kernel Block III Fellowships for BUIDL Track
- May 17 — Google Summer of Code Onboarding
- June 21-25 — Organization for Human Brain Mapping
- June 7 - August 17 — Google Summer of Code: Open Hacking
- August 15 — Neurotech survey respondents, first wave
- August 31 — First draft of educational materials to be hosted on https://www.opscientia.com/learn/
educational materials pushed back in timeline as we conduct user testing and build platform
All workshop materials, meetings, and code will be made publicly accessible on Github . Meeting events, topic posts, and tutorials will be hosted on medium . These events may be run every 4-6 weeks or more frequently as funding and open interest allow.
Opscientia is a company providing onboarding services and software infrastructure for launching decentralized autonomous organizations on distributed cloud public networks and smart contract blockchains. The team lead is an active member and contributor of the INCF, OHBM, and Brainhack Community.
Core Team Members
Shady El Damaty , M.Sc., Ph.D.
- Role : Cognitive Neuroscientist, Project Lead
- Github : github.com/seldamat
- info on projects at : seldamat.github.io
- Linkedin : linkedin.com/in/seldamat/
Sarah Hamburg , M.S., Ph.D.
- Role : Cognitive Neuroscientist
- Github : https://github.com/shamburgularara
- Linkedin : https://www.linkedin.com/in/sarah-hamburg-phd-9510a910a
Daniel Byington , M.Sc., M.B.A.
- Role : Business Development Specialist
- Linkedin : https://www.linkedin.com/in/daniel-byington-964a35a4/
Liliana Muscarella , M.A.
- Role : Global Partnerships Liason
- Linkedin : https://www.linkedin.com/in/lilianamuscarella/
Miri Rossitto , Founder @ COWE Communications
- Role : Brand Strategist
- Linkedin : https://www.linkedin.com/in/rossitto/
Alexandra McCarroll , M.Sc. (in proc)
- Role : Software & Data Engineer
- Linkedin: https://www.linkedin.com/in/alexandra-mccarroll-469108133/
- Github: https://github.com/XandraMcC
Kinshuk Kashyap , Summer Fellow
- Role : Software Engineer, Google Summer of Code Scholar
- Linkedin : https://www.linkedin.com/in/kinshuk-kashyap-32a4747b/
- Github : https://github.com/kinshukk
Achintya Kumar , Summer Fellow
- Role : Software Engineer, Opscientia Open Web Fellowship
- Linkedin: https://www.linkedin.com/in/achintya-kumar1/
- Github: https://github.com/Ackintya
Additional Information
This grant is a first step in building a decentralized neuroscience community running self-governed, owned, and automated science activities on-chain --> Opscientia DAO