Medical open network for AI

Last updated
MONAI
Developer(s) Nvidia, National Institutes of Health, King's College London
Initial releaseVersion 0.2.0 (November 23, 2021)
Stable release
Version 1.2.0 (April 30, 2023)
Repository github.com/Project-MONAI/MONAI
Written in Python
Platform Cross-platform
Available inEnglish
Type Health software
License Apache License
Website monai.io

Medical open network for AI (MONAI) is an open-source, community-supported framework for Deep learning (DL) in healthcare imaging. MONAI provides a collection of domain-optimized implementations of various DL algorithms and utilities specifically designed for medical imaging tasks. MONAI is used in research and industry, aiding the development of various medical imaging applications, including image segmentation, image classification, image registration, and image generation. [1]

Contents

MONAI was first introduced in 2019 by a collaborative effort of engineers from Nvidia, the National Institutes of Health, and the King's College London academic community. The framework was developed to address the specific challenges and requirements of DL applied to medical imaging. [1]

Built on top of PyTorch, a popular DL library, MONAI offers a high-level interface for performing everyday medical imaging tasks, including image preprocessing, augmentation, DL model training, evaluation, and inference for diverse medical imaging applications. MONAI simplifies the development of DL models for medical image analysis by providing a range of pre-built components and modules. [1] [2] [3]

MONAI is part of a larger suite of Artificial Intelligence (AI)-powered software called NVIDIA Clara. [4] Besides MONAI, Clara also comprises NVIDIA Parabricks for genome analysis. [5]

Medical image analysis foundations

Medical imaging strategies. (a) CT scan of the head. (b) MRI machine. (c) PET scans produce images of active blood flow and physiological activity in the targeted organ or organs. (d) Ultrasound technology to monitor pregnancy. 113abcd Medical Imaging Techniques.jpg
Medical imaging strategies. (a) CT scan of the head. (b) MRI machine. (c) PET scans produce images of active blood flow and physiological activity in the targeted organ or organs. (d) Ultrasound technology to monitor pregnancy.

Medical imaging is a range of imaging techniques and technologies that enables clinicians to visualize the internal structures of the human body. It aids in diagnosing, treating, and monitoring various medical conditions, thus allowing healthcare professionals to obtain detailed and non-invasive images of organs, tissues, and physiological processes. [6]

Medical imaging has evolved, driven by technological advancements and scientific understanding. Today, it encompasses modalities such as X-ray, Computed Tomography (CT), Magnetic Resonance Imaging (MRI), ultrasound, nuclear medicine, and digital pathology, each offering capabilities and insights into human anatomy and pathology. [6]

The images produced by these medical imaging modalities are interpreted by radiologists, trained specialists in analyzing and diagnosing medical conditions based on the visual information captured in the images. In recent years, the field has witnessed advancements in computer-aided diagnosis, integrating Artificial intelligence and Deep learning techniques to automatize medical image analysis and assist radiologists in detecting abnormalities and improving diagnostic accuracy. [7]

Features

MONAI provides a robust suite of libraries, tools, and Software Development Kits (SDKs) that encompass the entire process of building medical imaging applications. It offers a comprehensive range of resources to support every stage of developing Artificial intelligence (AI) solutions in the field of medical imaging, from initial annotation (MONAI Label), [2] through models development and evaluation (MONAI Core), [1] and final application deployment (MONAI deploy application SDK). [3]

Medical data labeling

AI-assisted annotation. MONAI Label utilizes AI algorithms to aid researchers and practitioners in medical imaging by providing annotation suggestions based on user interactions. Deepgrow.png
AI-assisted annotation. MONAI Label utilizes AI algorithms to aid researchers and practitioners in medical imaging by providing annotation suggestions based on user interactions.

MONAI Label is a versatile tool that enhances the image labeling and learning process by incorporating AI assistance. It simplifies the task of annotating new datasets by leveraging AI algorithms and user interactions. Through this collaboration, MONAI Label trains an AI model for a specific task and continually improves its performance as it receives additional annotated images. The tool offers a range of features and integrations that streamline the annotation workflow and ensure seamless integration with existing medical imaging platforms. [8]

Deep learning model development and evaluation

MONAI Core image segmentation example. Pipeline from training data retrieval through model implementation, training, and optimization to model inference. Auto3dseg.png
MONAI Core image segmentation example. Pipeline from training data retrieval through model implementation, training, and optimization to model inference.

Within MONAI Core, researchers can find a collection of tools and functionalities for dataset processing, loading, Deep learning (DL) model implementation, and evaluation. These utilities allow researchers to evaluate the performance of their models. MONAI Core offers customizable training pipelines, enabling users to construct and train models that support various learning approaches such as supervised, semi-supervised, and self-supervised learning. Additionally, users have the flexibility to implement different computing strategies to optimize the training process. [1]

AI-inference application development kit

MONAI Stream SDK application to endoscopy video AJA source MONAIStremSDK.svg
MONAI Stream SDK application to endoscopy video AJA source

The MONAI deploy application SDK offers a systematic series of steps empowering users to develop and fine-tune their AI models and workflows for deployment in clinical settings. These steps act as checkpoints, guaranteeing that the AI inference infrastructure adheres to the essential standards and requirements for seamless clinical integration. [3]

Key components of the MONAI Deploy Application SDK include:

Applications

MONAI has found applications in various research studies and industry implementations across different anatomical regions. For instance, it has been utilized in academic research involving automatic cranio-facial implant design, [29] brain tumor analysis from Magnetic Resonance images, [30] identification of features in focal liver lesions from MRI scans, [31] radiotherapy planning for prostate cancer, [32] preparation of datasets for fluorescence microscopy imaging, [33] and classification of pulmonary nodules in lung cancer. [34]

In healthcare settings, hospitals have leveraged MONAI to enhance mammography reading by employing Deep learning models for breast density analysis. This approach reduce the waiting time for patients, allowing them to receive mammography results within 15 minutes. Consequently, clinicians save time, and patients experience shorter wait times. This advancement enables patients to engage in immediate discussions with their clinicians during the same appointment, facilitating prompt decision-making and discussion of next steps before leaving the facility. Moreover, hospitals can employ MONAI to identify indications of a COVID-19 patient's deteriorating condition or determine if they can be safely discharged, optimizing patient care and post-COVID-19 decision-making. [35]

In the corporate realm, companies choose MONAI to develop product applications addressing various clinical challenges. These include ultrasound-based scoliosis assessment, Artificial intelligence-based pathology image labeling, in-field pneumothorax detection using ultrasound, characterization of brain morphology, detection of micro-fractures in teeth, and non-invasive estimation of intracranial pressure. [36]

See also

Related Research Articles

Nuxeo is a software company making an open source content management system.

<span class="mw-page-title-main">Microsoft Azure</span> Cloud computing platform by Microsoft

Microsoft Azure, or just Azure, is the cloud computing platform developed by Microsoft. It offers management, access and development of applications and services to individuals, companies, and governments through its global infrastructure. It also provides a range of capabilities, including software as a service (SaaS), platform as a service (PaaS), and infrastructure as a service (IaaS). Microsoft Azure supports many programming languages, tools, and frameworks, including Microsoft-specific and third-party software and systems.

HOOPS Visualize is a 3D computer graphics software designed to render graphics across both mobile and desktop platforms. HOOPS Visualize provides 3D Graphics API to render CAD models. It's part of the HOOPS 3D Application Framework SDK. Since June 2018 it's licensed via Siemens PLM Software.

Dart is a programming language designed by Lars Bak and Kasper Lund and developed by Google. It can be used to develop web and mobile apps as well as server and desktop applications.

Backend as a service (BaaS), sometimes also referred to as mobile backend as a service (MBaaS), is a service for providing web app and mobile app developers with a way to easily build a backend to their frontend applications. Features available include user management, push notifications, and integration with social networking services. These services are provided via the use of custom software development kits (SDKs) and application programming interfaces (APIs). BaaS is a relatively recent development in cloud computing, with most BaaS startups dating from 2011 or later. Some of the most popular service providers are AWS Amplify and Firebase.

<span class="mw-page-title-main">Plotly</span> Canadian computing company

Plotly is a technical computing company headquartered in Montreal, Quebec, that develops online data analytics and visualization tools. Plotly provides online graphing, analytics, and statistics tools for individuals and collaboration, as well as scientific graphing libraries for Python, R, MATLAB, Perl, Julia, Arduino, JavaScript and REST.

Eclipse Deeplearning4j is a programming library written in Java for the Java virtual machine (JVM). It is a framework with wide support for deep learning algorithms. Deeplearning4j includes implementations of the restricted Boltzmann machine, deep belief net, deep autoencoder, stacked denoising autoencoder and recursive neural tensor network, word2vec, doc2vec, and GloVe. These algorithms all include distributed parallel versions that integrate with Apache Hadoop and Spark.

<span class="mw-page-title-main">Ionic (mobile app framework)</span> Open-source framework to develop hybrid mobile apps

Ionic is an open-source UI toolkit for building cross-platform mobile, web, and desktop applications using web technologies such as HTML, CSS, and JavaScript/TypeScript. It provides a set of pre-designed UI components and tools for building high-quality, interactive applications. Ionic was originally built as a complete open-source SDK for hybrid mobile app development created by Max Lynch, Ben Sperry, and Adam Bradley of Drifty Co. in 2013. The original version was released in 2013 and built on top of AngularJS and Apache Cordova. However, the latest release was re-built as a set of Web Components using StencilJS, allowing the user to choose any user interface framework, such as Angular, React or Vue.js. It also allows the use of Ionic components with no user interface framework at all. Ionic provides tools and services for developing hybrid mobile, desktop, and progressive web apps based on modern web development technologies and practices, using Web technologies like CSS, HTML5, and Sass. In particular, mobile apps can be built with these Web technologies and then distributed through native app stores to be installed on devices by utilizing Cordova or Capacitor.

<span class="mw-page-title-main">KDE Gear</span> Set of applications and supporting libraries

The KDE Gear is a set of applications and supporting libraries that are developed by the KDE community, primarily used on Linux-based operating systems but mostly multiplatform, and released on a common release schedule.

MinIO is an object storage system released under GNU Affero General Public License v3.0. It is API compatible with the Amazon S3 cloud storage service. It is capable of working with unstructured data such as photos, videos, log files, backups, and container images with the maximum supported object size being 50TB.

Nvidia Jetson is a series of embedded computing boards from Nvidia. The Jetson TK1, TX1 and TX2 models all carry a Tegra processor from Nvidia that integrates an ARM architecture central processing unit (CPU). Jetson is a low-power system and is designed for accelerating machine learning applications.

<span class="mw-page-title-main">Dask (software)</span> Python library for parallel computing

Dask is an open-source Python library for parallel computing. Dask scales Python code from multi-core local machines to large distributed clusters in the cloud. Dask provides a familiar user interface by mirroring the APIs of other libraries in the PyData ecosystem including: Pandas, scikit-learn and NumPy. It also exposes low-level APIs that help programmers run custom algorithms in parallel.

Amazon SageMaker is a cloud-based machine-learning platform that allows the creation, training, and deployment by developers of machine-learning (ML) models on the cloud. It can be used to deploy ML models on embedded systems and edge-devices. The platform was launched in November 2017.

OpenVINO is an open-source software toolkit for optimizing and deploying deep learning models. It enables programmers to develop scalable and efficient AI solutions with relatively few lines of code. It supports several popular model formats and categories, such as large language models, computer vision, and generative AI.

FastAPI is a web framework first released in 2018 for building HTTP-based service APIs in Python. It is used for building APIs with Python 3.8+ based on standard Python-type hints.

<span class="mw-page-title-main">Windows App SDK</span> Software development kit from Microsoft

Windows App SDK is a software development kit (SDK) from Microsoft that provides a unified set of APIs and components that can be used to develop desktop applications for both Windows 11 and Windows 10 version 1809 and later. The purpose of this project is to offer a decoupled implementation of capabilities which were previously tightly-coupled to the UWP app model. Windows App SDK allows native Win32 (USER32/GDI32) or .NET (WPF/WinForms) developers alike a path forward to enhance their apps with modern features.

Hugging Face, Inc. is a French-American company incorporated under the Delaware General Corporation Law and based in New York City that develops computation tools for building applications using machine learning. It is most notable for its transformers library built for natural language processing applications and its platform that allows users to share machine learning models and datasets and showcase their work.

<span class="mw-page-title-main">Data Version Control (software)</span> Open source version system

DVC is a free and open-source, platform-agnostic version system for data, machine learning models, and experiments. It is designed to make ML models shareable, experiments reproducible, and to track versions of models, data, and pipelines. DVC works on top of Git repositories and cloud storage.

HarmonyOS NEXT is a proprietary distributed operating system and an iteration of HarmonyOS, developed by Huawei to support only HarmonyOS native apps. The operating system is primarily aimed at software and hardware developers that deal directly with Huawei. It does not include Android's AOSP core and is incompatible with Android applications.

References

  1. 1 2 3 4 5 6 7 8 9 Cardoso, M. Jorge; Li, Wenqi; Brown, Richard (2022-10-04). "MONAI: An open-source framework for deep learning in healthcare". pp. 1–25. arXiv: 2211.02701 [cs.LG].
  2. 1 2 Diaz-Pinto, Andres; Alle, Sachidanand; Nath, Vishwesh (2023-04-23). "MONAI Label: A framework for AI-assisted Interactive Labeling of 3D Medical Images". pp. 1–20. arXiv: 2203.12362 [cs.HC].
  3. 1 2 3 4 5 6 MONAI Deploy App SDK, Project MONAI, 2023-06-29, retrieved 2023-07-06
  4. "NVIDIA Clara". NVIDIA. Retrieved 2024-07-11.
  5. "NVIDIA Parabricks". NVIDIA. Retrieved 2024-07-11.
  6. 1 2 Dhawan, Atam P. (2011-01-24). Medical Image Analysis (1 ed.). Wiley. pp. 23–368. doi:10.1002/9780470918548. ISBN   978-0-470-62205-6.
  7. Doi, Kunio (2007). "Computer-aided diagnosis in medical imaging: Historical review, current status and future potential". Computerized Medical Imaging and Graphics. 31 (4–5): 198–211. doi:10.1016/j.compmedimag.2007.02.002. PMC   1955762 . PMID   17349778.
  8. 1 2 3 4 5 MONAI Label, Project MONAI, 2023-07-06, retrieved 2023-07-06
  9. "OHIF Viewer". viewer.ohif.org. Retrieved 2023-07-06.
  10. "QuPath". qupath.github.io. Retrieved 2023-07-06.
  11. "Harness the full potential of your digital pathology data". Digital Slide Archive. Retrieved 2023-07-06.
  12. "Descending into ML: Training and Loss | Machine Learning". Google for Developers. Retrieved 2023-07-06.
  13. An, Li-Bao (2011). "Cutting parameter optimization in milling operations by various solution methods". 2011 International Conference on Machine Learning and Cybernetics. IEEE. pp. 422–427. doi:10.1109/icmlc.2011.6016679. ISBN   978-1-4577-0305-8. S2CID   12484876.
  14. "Loss functions — MONAI 1.2.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  15. "Optimizers — MONAI 1.2.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  16. 1 2 "Metrics — MONAI 1.2.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  17. "DLProf User Guide" (PDF). docs.nvidia.com. Retrieved 2023-07-06.
  18. "Nsight Systems". NVIDIA Developer. 2018-03-12. Retrieved 2023-07-06.
  19. "NVIDIA Tools Extension (NVTX) Documentation". docs.nvidia.com. Retrieved 2023-07-06.
  20. "NVML API Reference Guide :: GPU Deployment and Management Documentation". docs.nvidia.com. Retrieved 2023-07-06.
  21. "PyTorch-Ignite". PyTorch-Ignite. Retrieved 2023-07-06.
  22. "XLA: Optimizing Compiler for Machine Learning". TensorFlow. Retrieved 2023-07-06.
  23. "Slurm Workload Manager - Documentation". slurm.schedmd.com. Retrieved 2023-07-06.
  24. "MONAI Model Zoo". monai.io. Retrieved 2023-07-06.
  25. "Model Bundle — MONAI 1.2.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  26. "Deploying and Hosting MONAI App Package — MONAI Deploy App SDK 0.4.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  27. "Packaging app — MONAI Deploy App SDK 0.4.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  28. "Executing packaged app locally — MONAI Deploy App SDK 0.4.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  29. Li, Jianning; Ferreira, André; Puladi, Behrus; Alves, Victor; Kamp, Michael; Kim, Moon; Nensa, Felix; Kleesiek, Jens; Ahmadi, Seyed-Ahmad; Egger, Jan (2023). "Open-source skull reconstruction with MONAI". SoftwareX. 23: 101432. arXiv: 2211.14051 . Bibcode:2023SoftX..2301432L. doi:10.1016/j.softx.2023.101432. ISSN   2352-7110.
  30. Sharma, Suraj Prakash; Sampath, Nalini (2022-06-24). "Data Augmentation for Brain Tumor Segmentation using MONAI Framework". 2022 2nd International Conference on Intelligent Technologies (CONIT). IEEE. pp. 1–8. doi:10.1109/conit55038.2022.9847822. ISBN   978-1-6654-8407-7. S2CID   251761810.
  31. Stollmayer, Róbert; Budai, Bettina Katalin; Rónaszéki, Aladár; Zsombor, Zita; Kalina, Ildikó; Hartmann, Erika; Tóth, Gábor; Szoldán, Péter; Bérczi, Viktor; Maurovich-Horvat, Pál; Kaposi, Pál Novák (2022-05-05). "Focal Liver Lesion MRI Feature Identification Using Efficientnet and MONAI: A Feasibility Study". Cells. 11 (9): 1558. doi: 10.3390/cells11091558 . ISSN   2073-4409. PMC   9104155 . PMID   35563862.
  32. Belue, Mason J.; Harmon, Stephanie A.; Patel, Krishnan; Daryanani, Asha; Yilmaz, Enis Cagatay; Pinto, Peter A.; Wood, Bradford J.; Citrin, Deborah E.; Choyke, Peter L.; Turkbey, Baris (2022). "Development of a 3D CNN-based AI Model for Automated Segmentation of the Prostatic Urethra". Academic Radiology. 29 (9): 1404–1412. doi:10.1016/j.acra.2022.01.009. ISSN   1076-6332. PMC   9339453 . PMID   35183438.
  33. Poon, Charissa; Teikari, Petteri; Rachmadi, Muhammad Febrian; Skibbe, Henrik; Hynynen, Kullervo (2022-07-20). "MiniVess: A dataset of rodent cerebrovasculature from in vivo multiphoton fluorescence microscopy imaging": 1–11. doi:10.1101/2022.07.19.500542.{{cite journal}}: Cite journal requires |journal= (help)
  34. Kaliyugarasan, Satheshkumar; Lundervold, Arvid; Lundervold, Alexander Selvikvåg (2021). "Pulmonary Nodule Classification in Lung Cancer from 3D Thoracic CT Scans Using fastai and MONAI". International Journal of Interactive Multimedia and Artificial Intelligence. 6 (7): 83. doi: 10.9781/ijimai.2021.05.002 . hdl: 11250/2772280 . ISSN   1989-1660. S2CID   236693622.
  35. Durlach, Peter (2023-05-11). "Microsoft and our partners: Bringing AI models to clinical settings". Microsoft Industry Blogs. Retrieved 2023-07-06.
  36. Dennison, Jaisil Rose (2022-08-11). "Leveraging MONAI for Medical AI". www.kitware.com. Retrieved 2023-07-06.

Further reading