List of manual image annotation tools

Last updated

Manual image annotation is the process of manually defining regions in an image and creating a textual description of those regions. Such annotations can for instance be used to train machine learning algorithms for computer vision applications.

This is a list of computer software which can be used for manual annotation of images.

SoftwareDescriptionPlatformLicenseReferences
Computer Vision Annotation Tool (CVAT)Computer Vision Annotation Tool (CVAT) is a free, open source, web-based annotation tool which helps to label video and images for computer vision algorithms. CVAT has many powerful features: interpolation of bounding boxes between key frames, automatic annotation using TensorFlow OD API and deep learning models in Intel OpenVINO IR format, shortcuts for most of critical actions, dashboard with a list of annotation tasks, LDAP and basic authorizations, etc. It was created for and used by a professional data annotation team. UX and UI were optimized especially for computer vision annotation tasks.JavaScript, HTML, CSS, Python, Django MIT License [1] [2] [3]
LabelMe Online annotation tool to build image databases for computer vision research.Perl, JavaScript, HTML, CSS [4] MIT License
Encord Encord is an automated annotation platform for AI-assisted image annotation, video annotation, and dataset management.
  • Data Management: Compile your raw data into curated datasets, organize datasets into folders, and send datasets for labeling. AI-assisted Labeling: Automate 97% of your annotations with 99% accuracy using auto-annotation features powered by Meta's Segment Anything Model or GPT-4’s LLaVA. Collaboration: Integrate human-in-the-loop seamlessly with customized Workflows - create workflows with the no-code drag and drop builder to fit your data ops & ML pipelines.
  • Quality Assurance: Robust annotator management & QA workflows to track annotator performance and increase label quality. Integrated Data Labeling Services for all Industries: outsource your labeling tasks to an expert workforce of vetted, trained and specialized annotators to help you scale.
  • Video Labeling Tool: provides the same support for video annotation. One of the leading video annotation tools with positive customer reviews, providing automated video annotations without frame rate errors.
    Robust Security Functionality: label audit trails, encryption, FDA, CE Compliance, and HIPAA compliance.
  • Integrations: Advanced Python SDK and API access (+ easy export into JSON and COCO formats).
Python, JavaScript, HTML, CSS [5] Apache-2.0 License
TagLab Desktop open source interactive software system for facilitating the precise annotation of benthic species in orthophoto of the bottom of the sea.Python [6] GPL [7] [8]
VoTT (Visual Object Tagging Tool)Free and open source electron app for image annotation and labeling developed by Microsoft. TypeScript/Electron (Windows, Linux, macOS) MIT License [9] [10] [11] [12] [13] [14]

Related Research Articles

<span class="mw-page-title-main">GIMP</span> Open source raster graphics editor

GNU Image Manipulation Program, commonly known by its acronym GIMP, is a free and open-source raster graphics editor used for image manipulation (retouching) and image editing, free-form drawing, transcoding between different image file formats, and more specialized tasks. It is extensible by means of plugins, and scriptable. It is not designed to be used for drawing, though some artists and creators have used it in this way.

<span class="mw-page-title-main">Git</span> Software for version control of files

Git is a distributed version control system that tracks changes in any set of computer files, usually used for coordinating work among programmers who are collaboratively developing source code during software development.

<span class="mw-page-title-main">OpenCV</span> Computer vision library

OpenCV is a library of programming functions mainly for real-time computer vision. Originally developed by Intel, it was later supported by Willow Garage, then Itseez. The library is cross-platform and licensed as free and open-source software under Apache License 2. Starting in 2011, OpenCV features GPU acceleration for real-time operations.

<span class="mw-page-title-main">Markdown</span> Plain text markup language

Markdown is a lightweight markup language for creating formatted text using a plain-text editor. John Gruber and Aaron Swartz created Markdown in 2004 as a markup language that is intended to be easy to read in its source code form. Markdown is widely used for blogging and instant messaging, and also used elsewhere in online forums, collaborative software, documentation pages, and readme files.

LabelMe is a project created by the MIT Computer Science and Artificial Intelligence Laboratory (CSAIL) which provides a dataset of digital images with annotations. The dataset is dynamic, free to use, and open to public contribution. The most applicable use of LabelMe is in computer vision research. As of October 31, 2010, LabelMe has 187,240 images, 62,197 annotated images, and 658,992 labeled objects.

CellCognition is a free open-source computational framework for quantitative analysis of high-throughput fluorescence microscopy (time-lapse) images in the field of bioimage informatics and systems microscopy. The CellCognition framework uses image processing, computer vision and machine learning techniques for single-cell tracking and classification of cell morphologies. This enables measurements of temporal progression of cell phases, modeling of cellular dynamics and generation of phenotype map.

Eclipse Deeplearning4j is a programming library written in Java for the Java virtual machine (JVM). It is a framework with wide support for deep learning algorithms. Deeplearning4j includes implementations of the restricted Boltzmann machine, deep belief net, deep autoencoder, stacked denoising autoencoder and recursive neural tensor network, word2vec, doc2vec, and GloVe. These algorithms all include distributed parallel versions that integrate with Apache Hadoop and Spark.

Perforce Software, Inc. is an American developer of software used for developing and running applications, including version control software, web-based repository management, developer collaboration, application lifecycle management, web application servers, debugging tools and agile planning software.

<span class="mw-page-title-main">KDE Gear</span> Set of applications and supporting libraries

The KDE Gear is a set of applications and supporting libraries that are developed by the KDE community, primarily used on Linux-based operating systems but mostly multiplatform, and released on a common release schedule.

<span class="mw-page-title-main">Art of Illusion</span>

Art of Illusion is a free software, and open source software package for making 3D graphics.

JPEG XL is a royalty-free raster-graphics file format that supports both lossy and lossless compression. It is designed to outperform existing raster formats and thus become their universal replacement.

<span class="mw-page-title-main">Project Jupyter</span> Open source data science software

Project Jupyter is a project to develop open-source software, open standards, and services for interactive computing across multiple programming languages.

In computer vision, SqueezeNet is the name of a deep neural network for image classification that was released in 2016. SqueezeNet was developed by researchers at DeepScale, University of California, Berkeley, and Stanford University. In designing SqueezeNet, the authors' goal was to create a smaller neural network with fewer parameters while achieving competitive accuracy.

Microsoft, a technology company historically known for its opposition to the open source software paradigm, turned to embrace the approach in the 2010s. From the 1970s through 2000s under CEOs Bill Gates and Steve Ballmer, Microsoft viewed the community creation and sharing of communal code, later to be known as free and open source software, as a threat to its business, and both executives spoke negatively against it. In the 2010s, as the industry turned towards cloud, embedded, and mobile computing—technologies powered by open source advances—CEO Satya Nadella led Microsoft towards open source adoption although Microsoft's traditional Windows business continued to grow throughout this period generating revenues of 26.8 billion in the third quarter of 2018, while Microsoft's Azure cloud revenues nearly doubled.

<span class="mw-page-title-main">Computer Vision Annotation Tool</span> Free and open source, web-based image and video annotation tool

Computer Vision Annotation Tool (CVAT) is a free, open source, web-based image and video annotation tool which is used for labeling data for computer vision algorithms. Originally developed by Intel, CVAT is designed for use by a professional data annotation team, with a user interface optimized for computer vision annotation tasks.

<span class="mw-page-title-main">Foliate (software)</span> E-book reading application for Linux

Foliate is a free e-book reading application for desktop Linux systems. The name refers to leaves, meaning "(getting) leafy" or "…-leaved".

VoTT is a free and open source Electron app for image annotation and labeling developed by Microsoft. The software is written in the TypeScript programming language and used for building end-to-end object detection models from image and videos assets for computer vision algorithms.

Medical open network for AI (MONAI) is an open-source, community-supported framework for Deep learning (DL) in healthcare imaging. MONAI provides a collection of domain-optimized implementations of various DL algorithms and utilities specifically designed for medical imaging tasks. MONAI is used in research and industry, aiding the development of various medical imaging applications, including image segmentation, image classification, image registration, and image generation.

References

  1. "Intel open-sources CVAT, a toolkit for data labeling". VentureBeat. 2019-03-05. Retrieved 2019-03-09.
  2. "Computer Vision Annotation Tool: A Universal Approach to Data Annotation". software.intel.com. 2019-03-01. Retrieved 2019-03-09.
  3. "Computer Vision Annotation Tool (CVAT) source code on github". GitHub . Retrieved 3 March 2019.
  4. "LabelMe Source". GitHub . Retrieved 26 January 2017.
  5. "Encord Source". Documentation . Retrieved 26 January 2017.
  6. "TagLab Source". GitHub . Retrieved 5 July 2023.
  7. Pavoni, Gaia; Corsini, Massimiliano; Ponchio, Federico; Muntoni, Alessandro; Edwards, Clinton; Pedersen, Nicole; Sandin, Stuart; Cignoni, Paolo (2022). "TagLab: AI-assisted annotation for the fast and accurate semantic segmentation of coral reef orthoimages". Journal of Field Robotics. 39 (3): 246–262. doi:10.1002/rob.22049. S2CID   244648241.
  8. Costa, Bryan; Sweeney, Edward; Mendez, Arnold (October 2022). "Leveraging Artificial Intelligence to Annotate Marine Benthic Species and Habitats". Noaa Technical Memorandum Nos Nccos. 306. doi:10.25923/7kgv-ba52.
  9. Tung, Liam. "Free AI developer app: IBM's new tool can label objects in videos for you". ZDNet.
  10. Bornstein, Aaron (Ari) (February 4, 2019). "Using Object Detection for Complex Image Classification Scenarios Part 4". Medium.
  11. Solawetz, Jacob (July 27, 2020). "Getting Started with VoTT Annotation Tool for Computer Vision". Roboflow Blog.
  12. "Best Open Source Annotation Tools for Computer Vision". www.sicara.ai.
  13. "Beyond Sentiment Analysis: Object Detection with ML.NET". September 20, 2020.
  14. "GitHub - microsoft/VoTT: Visual Object Tagging Tool: An electron app for building end to end Object Detection Models from Images and Videos". November 15, 2020 via GitHub.