Evotegra: Finish Dataset Extension & Kickoff Collector Union

Name of Project: Extend Dataset & Data Economy Poll

Proposal in one sentence

Extend the round 8 promotional 100k traffic dataset with annotations for licence plates and faces to enable the creation and evaluation of automated anonymization algorithms and kickstart the evaluation of technical prerequisites leading to a Smart Collector Union.

Description of the project and what problem is it solving:

In Ocean DAO Round 8 we committed to create a professional traffic image dataset with at least 100.000 images in ~300 categories and an estimated million objects. The dataset will soon be published on the Ocean Market with an initial data token price of 1 Ocean. The idea is to create promotional datasets that help to establish Ocean Market as a prime source of high quality data.
We propose to extend this promotional dataset next the promised 300 classes with annotations for faces and license plates. By adding these 2 classes we add significant extra value to the dataset which then can be used to train and evaluate automated anonymization algorithms which is an important requirement to assure compliance with privacy regulation such as GDPR.
Our goal is to finish the dataset extension in R14.
Next to creating new data we propose to start the evaluation of technical requirements leading to a Collector Union. A Collector Union consists of a swarm of AI enabled embedded systems that collectively accumulate data that has been inspected by an AI running on the embedded system. The purpose of the AI is to evaluate the data in real time and hence to filter all irrelevant or redundant data very early in the collection process. With this grant we want to evaluate the possibility to manage AI enabled devices in a Kubernetes Cluster to dynamically configure the detection of a swarm of connected devices at runtime.

Category

Unleash data

Metric

Data Consume Volume

What is the final product?

  1. The final product is an extension of the promotional image dataset proposed in DAO round 8 with 2 additional classes “license plates” and “faces”.
  2. Publish the results of the evaluation which are a important milestone in the creation of Collector Unions

Grant deliverables

Round 8 - Requested $10k. All Funding Deployed. Publish carry over to R11

[x] Collect 100.000 images from diverse road scenes in Germany (~2-3 month)
[x] Annotate the images in ~300 classes (~1 month)
[x] Publish the dataset in a pool for the initial price of 1 Ocean (~September 2021)
Could not be published yet. Issue reported but won’t be fixed before OceanV4.

Round 11 - Requested $3k. $3k + R8 publish carry over to R12.

[x] Extend the promotional 100k dataset with face annotations
[x] Extend the promotional 100k dataset with license plate annotations
[x] Provide 5000 annotated images

Round 12 - Requested $3k. $3k deployed from carry over. 3K + R8 publish carry over to R13.

[x] Extend the promotional 100k dataset with face annotations
[x] Extend the promotional 100k dataset with license plate annotations
[x] Provide 10000 annotated images

Round 13 - Requested $9.5k. $3k deployed from carry over. 6k deployed. 3K + R8 publish carry over to R14

[x] Extend the promotional 100k dataset with face annotations
[x] Extend the promotional 100k dataset with license plate annotations
[x] Provide additional 10000 annotated images
[x] Conduct a data economy poll along with German AI association members
[x] Present results of the poll to data consume and parameters working group

Round 14

  1. Extend the promotional 100k dataset with face annotations
  2. Extend the promotional 100k dataset with license plate annotations
  3. Provide additional 65000 annotated images. The results is a fully extended R8 dataset
  4. Publish a github repo with instructions on how to setup Kubernetes on AI enabled embedded systems.

R8 and R13 Deliverables

The attempt to to publish the R8 dataset on Ocean Market was unsuccessful. Due to lack of resources within Ocean Core the issue is unfortunately not going to be fixed prior to the Ocean V4 release. Therefore we put the publishing on hold until Ocean V4 release.
We finished all our R12 deliverables and all R13 annotations and we will provide the poll for the German AI association in addition to our R14 grant.

Funding Requested:

9.5k USD

ROI

We see this effort as a starting point to add other promotional datasets. Our goal is to establish Ocean as a prime source for high quality data. The attractive entry price should encourage many first time users to get familiar with wallets, Ocean tokens and the Ocean Market itself. By enabling new use cases to train and test automated anonymization algorithms this will notably increase the value of the data.
Smart Collector Unions presents a ultra scalable solution to create high quality data at unprecedented costs. Using Ocean Market as a platform to monetize both data and algorithms Smart Collector Unions present a new form of community owned data economy solutions based on Ocean Protocol.

Proposal Wallet Address:

0x61B15998893cC746B46C08FEdEE13a0d1b33bBa9

Team

Web: www.evotegra.de
Email: manthey@evotegra.de
Twitter: @evotegra
Country of Residence: Germany

Previous OceanDAO Grant?

Yes

Thank you for submitting your proposal and updates@tmanthey, I have now updated Round 13 + submitted your proposal to Round 14. They are in good standing, and this proposal has been accepted into R14.

For posterity, could you please reply to your Round 13 proposal with the R13 deliverable update you submitted in here?

All the best!

1 Like

ok. I updated R13 with a reply.

1 Like

Hi @tmanthey,

Thank you for submitting your proposal for R-14!

I am a Project-Guiding Member and have assigned myself to help you.

I have reviewed your proposal and would like to thank you for your participation inside of the Ocean Ecosystem!

Your project looks promising and I believe it’s aligned with our evaluation criteria of generating positive value towards the Ocean Ecosystem and the W3SL.

The project criteria are:

  1. Usage of Ocean — how well might the project drive usage of Ocean? The Evotegra team have been focussed on building a high quality dataset inside the Ocean ecosystem for some time and if they are able to showcase the benefits of Ocean through the work they are doing with Evotegra it could attract many more large, quality datasets to the Ocean ecosystem.
  2. Viability — what is the chance of success of the project? The dataset itself has market value so n that sense it is already successful.
  3. Community active-ness — how active is the team in the community? @tmanthey is very active in the Ocean ecosystem and has been for quite some time.
  4. Adding value to the community — how well does the outcome of the project add value to the overall Ocean community / ecosystem? As mentioned earlier, if this project can act as a successful use case inside the Ocean ecosystem of how large, quality datasets can drive value inside the ecosystem and the benefits of that can be shared with owners/publishers of other large, quality data sets and attract them, then the overall value the Evotegra team can add to Ocean is very high.

Based on the reasons above, I am in support of your project and proposal. I look forward to continuing providing support and feedback to your project.

All the best!

-Your PGWG Guide
Scott__Sigil

3 Likes

Thank you for the review @Scotty, and Tobias for continuing to grow and improve the Evotegra dataset.