Tensor.Rocks 🔥

Achieve the same prediction quality with 50% less data labeling costs

We are making an active learning platform, so you can label less data while achieving the same prediction quality.

What is Active Learning?

Active learning is a way to reduce data labeling costs during the model training stage. To do so one should label more unusual, diverse data. We do this iteratively.

First, we need the pool of unlabelled data. It can be 🖼️ images, 📑 texts, 🎵 audio, 🎬 video and practically any type of data.

Then, a seed dataset is created. We sample a small portion (up to 100 examples) of the most diverse data points and send them to labelers.

After that, the active learning begins:

  1. Model is trained on labeled data.

    ⬇️

  1. A trained model is used to score the pool of unlabelled samples to find a batch of the most informative ones.

    ⬇️

  1. The batch is sent to labelers.

    ⬇️

  1. A labeled batch is added to the labeled data pool.

    🔃

  1. Repeat from step 1 until the desired quality is achieved.

In practice, it allows us to label 50% of data to achieve 99.9% of the SOTA quality.

Features

Follow the development

Tensor.Rocks
tensor-rocks doesn't have any pinned public repositories yet. Repositories This organization has no public repositories. People This organization has no public members. You must be a member to see who's a part of this organization. Most used topics Loading... You can't perform that action at this time.
https://github.com/tensor-rocks