Medical open network for AI

Last updated
MONAI
Developer(s) Nvidia, National Institutes of Health, King's College London
Initial releaseVersion 0.2.0 (November 23, 2021)
Stable release
Version 1.2.0 (April 30, 2023)
Repository github.com/Project-MONAI/MONAI
Written in Python
Platform Cross-platform
Available inEnglish
Type Health software
License Apache License
Website monai.io

Medical open network for AI (MONAI) is an open-source, community-supported framework for Deep learning (DL) in healthcare imaging. MONAI provides a collection of domain-optimized implementations of various DL algorithms and utilities specifically designed for medical imaging tasks. MONAI is used in research and industry, aiding the development of various medical imaging applications, including image segmentation, image classification, image registration, and image generation. [1]

Contents

MONAI was first introduced in 2019 by a collaborative effort of engineers from Nvidia, the National Institutes of Health, and the King's College London academic community. The framework was developed to address the specific challenges and requirements of DL applied to medical imaging. [1]

Built on top of PyTorch, a popular DL library, MONAI offers a high-level interface for performing everyday medical imaging tasks, including image preprocessing, augmentation, DL model training, evaluation, and inference for diverse medical imaging applications. MONAI simplifies the development of DL models for medical image analysis by providing a range of pre-built components and modules. [1] [2] [3]

Medical image analysis foundations

Medical imaging strategies. (a) CT scan of the head. (b) MRI machine. (c) PET scans produce images of active blood flow and physiological activity in the targeted organ or organs. (d) Ultrasound technology to monitor pregnancy. 113abcd Medical Imaging Techniques.jpg
Medical imaging strategies. (a) CT scan of the head. (b) MRI machine. (c) PET scans produce images of active blood flow and physiological activity in the targeted organ or organs. (d) Ultrasound technology to monitor pregnancy.

Medical imaging is a range of imaging techniques and technologies that enables clinicians to visualize the internal structures of the human body. It aids in diagnosing, treating, and monitoring various medical conditions, thus allowing healthcare professionals to obtain detailed and non-invasive images of organs, tissues, and physiological processes. [4]

Medical imaging has evolved, driven by technological advancements and scientific understanding. Today, it encompasses modalities such as X-ray, Computed Tomography (CT), Magnetic Resonance Imaging (MRI), ultrasound, nuclear medicine, and digital pathology, each offering capabilities and insights into human anatomy and pathology. [4]

The images produced by these medical imaging modalities are interpreted by radiologists, trained specialists in analyzing and diagnosing medical conditions based on the visual information captured in the images. In recent years, the field has witnessed advancements in computer-aided diagnosis, integrating Artificial intelligence and Deep learning techniques to automatize medical image analysis and assist radiologists in detecting abnormalities and improving diagnostic accuracy. [5]

Features

MONAI provides a robust suite of libraries, tools, and Software Development Kits (SDKs) that encompass the entire process of building medical imaging applications. It offers a comprehensive range of resources to support every stage of developing Artificial intelligence (AI) solutions in the field of medical imaging, from initial annotation (MONAI Label), [2] through models development and evaluation (MONAI Core), [1] and final application deployment (MONAI deploy application SDK). [3]

Medical data labeling

AI-assisted annotation. MONAI Label utilizes AI algorithms to aid researchers and practitioners in medical imaging by providing annotation suggestions based on user interactions. Deepgrow.png
AI-assisted annotation. MONAI Label utilizes AI algorithms to aid researchers and practitioners in medical imaging by providing annotation suggestions based on user interactions.

MONAI Label is a versatile tool that enhances the image labeling and learning process by incorporating AI assistance. It simplifies the task of annotating new datasets by leveraging AI algorithms and user interactions. Through this collaboration, MONAI Label trains an AI model for a specific task and continually improves its performance as it receives additional annotated images. The tool offers a range of features and integrations that streamline the annotation workflow and ensure seamless integration with existing medical imaging platforms. [6]

Deep learning model development and evaluation

MONAI Core image segmentation example. Pipeline from training data retrieval through model implementation, training, and optimization to model inference. Auto3dseg.png
MONAI Core image segmentation example. Pipeline from training data retrieval through model implementation, training, and optimization to model inference.

Within MONAI Core, researchers can find a collection of tools and functionalities for dataset processing, loading, Deep learning (DL) model implementation, and evaluation. These utilities allow researchers to evaluate the performance of their models. MONAI Core offers customizable training pipelines, enabling users to construct and train models that support various learning approaches such as supervised, semi-supervised, and self-supervised learning. Additionally, users have the flexibility to implement different computing strategies to optimize the training process. [1]

AI-inference application development kit

MONAI Stream SDK application to endoscopy video AJA source MONAIStremSDK.svg
MONAI Stream SDK application to endoscopy video AJA source

The MONAI deploy application SDK offers a systematic series of steps empowering users to develop and fine-tune their AI models and workflows for deployment in clinical settings. These steps act as checkpoints, guaranteeing that the AI inference infrastructure adheres to the essential standards and requirements for seamless clinical integration. [3]

Key components of the MONAI Deploy Application SDK include:

Applications

MONAI has found applications in various research studies and industry implementations across different anatomical regions. For instance, it has been utilized in academic research involving automatic cranio-facial implant design, [27] brain tumor analysis from Magnetic Resonance images, [28] identification of features in focal liver lesions from MRI scans, [29] radiotherapy planning for prostate cancer, [30] preparation of datasets for fluorescence microscopy imaging, [31] and classification of pulmonary nodules in lung cancer. [32]

In healthcare settings, hospitals have leveraged MONAI to enhance mammography reading by employing Deep learning models for breast density analysis. This approach reduce the waiting time for patients, allowing them to receive mammography results within 15 minutes. Consequently, clinicians save time, and patients experience shorter wait times. This advancement enables patients to engage in immediate discussions with their clinicians during the same appointment, facilitating prompt decision-making and discussion of next steps before leaving the facility. Moreover, hospitals can employ MONAI to identify indications of a COVID-19 patient's deteriorating condition or determine if they can be safely discharged, optimizing patient care and post-COVID-19 decision-making. [33]

In the corporate realm, companies choose MONAI to develop product applications addressing various clinical challenges. These include ultrasound-based scoliosis assessment, Artificial intelligence-based pathology image labeling, in-field pneumothorax detection using ultrasound, characterization of brain morphology, detection of micro-fractures in teeth, and non-invasive estimation of intracranial pressure. [34]

See also

Related Research Articles

<span class="mw-page-title-main">CUDA</span> Parallel computing platform and programming model

Compute Unified Device Architecture (CUDA) is a parallel computing platform and application programming interface (API) that allows software to use certain types of graphics processing units (GPUs) for accelerated general-purpose processing, an approach called general-purpose computing on GPUs (GPGPU). CUDA API and its runtime: The CUDA API is an extension of the C programming language that adds the ability to specify thread-level parallelism in C and also to specify GPU device specific operations (like moving data between the CPU and the GPU). CUDA is a software layer that gives direct access to the GPU's virtual instruction set and parallel computational elements for the execution of compute kernels. In addition to drivers and runtime kernels, the CUDA platform includes compilers, libraries and developer tools to help programmers accelerate their applications.

Nuxeo is a software company making an open source content management system.

Microsoft Azure, often referred to as Azure, is a cloud computing platform developed by Microsoft. It offers access, management, and the development of applications and services through global data centers. It also provides a range of capabilities, including software as a service (SaaS), platform as a service (PaaS), and infrastructure as a service (IaaS). Microsoft Azure supports many programming languages, tools, and frameworks, including Microsoft-specific and third-party software and systems.

HOOPS Visualize is a 3D computer graphics software designed to render graphics across both mobile and desktop platforms. HOOPS Visualize provides 3D Graphics API to render CAD models. It's part of the HOOPS 3D Application Framework SDK. Since June 2018 it's licensed via Siemens PLM Software.

Windows UI Library is a user interface API that is part of the Windows Runtime programming model that forms the backbone of Universal Windows Platform apps for the Windows 8, Windows 8.1, Windows 10 and Windows Phone 8.1 operating systems. It enables declaring user interfaces using Extensible Application Markup Language (XAML) technology.

Backend as a service (BaaS), sometimes also referred to as mobile backend as a service (MBaaS), is a service for providing web app and mobile app developers with a way to easily build a backend to their frontend applications. Features available include user management, push notifications, and integration with social networking services. These services are provided via the use of custom software development kits (SDKs) and application programming interfaces (APIs). BaaS is a relatively recent development in cloud computing, with most BaaS startups dating from 2011 or later. Some of the most popular service providers are AWS Amplify and Firebase.

<span class="mw-page-title-main">Plotly</span> Canadian computing company

Plotly is a technical computing company headquartered in Montreal, Quebec, that develops online data analytics and visualization tools. Plotly provides online graphing, analytics, and statistics tools for individuals and collaboration, as well as scientific graphing libraries for Python, R, MATLAB, Perl, Julia, Arduino, JavaScript and REST.

Eclipse Deeplearning4j is a programming library written in Java for the Java virtual machine (JVM). It is a framework with wide support for deep learning algorithms. Deeplearning4j includes implementations of the restricted Boltzmann machine, deep belief net, deep autoencoder, stacked denoising autoencoder and recursive neural tensor network, word2vec, doc2vec, and GloVe. These algorithms all include distributed parallel versions that integrate with Apache Hadoop and Spark.

<span class="mw-page-title-main">Ionic (mobile app framework)</span> Open-source framework to develop hybrid mobile apps

Ionic is an open-source UI toolkit for building cross-platform mobile, web, and desktop applications using web technologies such as HTML, CSS, and JavaScript/TypeScript. It provides a set of pre-designed UI components and tools for building high-quality, interactive applications. Ionic was originally built as a complete open-source SDK for hybrid mobile app development created by Max Lynch, Ben Sperry, and Adam Bradley of Drifty Co. in 2013. The original version was released in 2013 and built on top of AngularJS and Apache Cordova. However, the latest release was re-built as a set of Web Components using StencilJS, allowing the user to choose any user interface framework, such as Angular, React or Vue.js. It also allows the use of Ionic components with no user interface framework at all. Ionic provides tools and services for developing hybrid mobile, desktop, and progressive web apps based on modern web development technologies and practices, using Web technologies like CSS, HTML5, and Sass. In particular, mobile apps can be built with these Web technologies and then distributed through native app stores to be installed on devices by utilizing Cordova or Capacitor.

<span class="mw-page-title-main">KDE Gear</span> Set of applications and supporting libraries

The KDE Gear is a set of applications and supporting libraries that are developed by the KDE community, primarily used on Linux-based operating systems but mostly multiplatform, and released on a common release schedule.

<span class="mw-page-title-main">TensorFlow</span> Machine learning software library

TensorFlow is a free and open-source software library for machine learning and artificial intelligence. It can be used across a range of tasks but has a particular focus on training and inference of deep neural networks.

Nvidia Jetson is a series of embedded computing boards from Nvidia. The Jetson TK1, TX1 and TX2 models all carry a Tegra processor from Nvidia that integrates an ARM architecture central processing unit (CPU). Jetson is a low-power system and is designed for accelerating machine learning applications.

Clarifai is an independent artificial intelligence company that specializes in computer vision, natural language processing, and audio recognition. One of the first deep learning platforms having been founded in 2013, Clarifai provides an AI platform for unstructured image, video, text, and audio data. Its platform supports the full AI lifecycle for data exploration, data labeling, model training, evaluation and inference around images, video, text, and audio data. Headquartered in Washington DC and with employees in the US, Canada, Argentina, Estonia and India Clarifai uses machine learning and deep neural networks to identify and analyze images, videos, text and audio automatically. Clarifai enables users to implement AI technology into their products.

<span class="mw-page-title-main">Dask (software)</span> Python library for parallel computing

Dask is an open-source Python library for parallel computing. Dask scales Python code from multi-core local machines to large distributed clusters in the cloud. Dask provides a familiar user interface by mirroring the APIs of other libraries in the PyData ecosystem including: Pandas, scikit-learn and NumPy. It also exposes low-level APIs that help programmers run custom algorithms in parallel.

Amazon SageMaker is a cloud based machine-learning platform that allows the creation, training, and deployment by developers of machine-learning (ML) models on the cloud. It can be used to deploy ML models on embedded systems and edge-devices. SageMaker was launched in November 2017.

OpenVINO is an open-source software toolkit for optimizing and deploying deep learning models. It enables programmers to develop scalable and efficient AI solutions with relatively few lines of code. It supports several popular model formats and categories, such as large language models, computer vision, and generative AI.

<span class="mw-page-title-main">Windows App SDK</span> Software development kit from Microsoft

Windows App SDK is a software development kit (SDK) from Microsoft that provides a unified set of APIs and components that can be used to develop desktop applications for both Windows 11 and Windows 10 version 1809 and later. The purpose of this project is to offer a decoupled implementation of capabilities which were previously tightly-coupled to the UWP app model. Windows App SDK allows native Win32 (USER32/GDI32) or .NET (WPF/WinForms) developers alike a path forward to enhance their apps with modern features.

Hugging Face, Inc. is a French-American company based in New York City that develops computation tools for building applications using machine learning. It is most notable for its transformers library built for natural language processing applications and its platform that allows users to share machine learning models and datasets and showcase their work.

References

  1. 1 2 3 4 5 6 7 8 9 Cardoso, M. Jorge; Li, Wenqi; Brown, Richard (2022-10-04). "MONAI: An open-source framework for deep learning in healthcare". pp. 1–25. arXiv: 2211.02701 [cs.LG].
  2. 1 2 Diaz-Pinto, Andres; Alle, Sachidanand; Nath, Vishwesh (2023-04-23). "MONAI Label: A framework for AI-assisted Interactive Labeling of 3D Medical Images". pp. 1–20. arXiv: 2203.12362 [cs.HC].
  3. 1 2 3 4 5 6 MONAI Deploy App SDK, Project MONAI, 2023-06-29, retrieved 2023-07-06
  4. 1 2 Dhawan, Atam P. (2011-01-24). Medical Image Analysis (1 ed.). Wiley. pp. 23–368. doi:10.1002/9780470918548. ISBN   978-0-470-62205-6.
  5. Doi, Kunio (2007). "Computer-aided diagnosis in medical imaging: Historical review, current status and future potential". Computerized Medical Imaging and Graphics. 31 (4–5): 198–211. doi:10.1016/j.compmedimag.2007.02.002. PMC   1955762 . PMID   17349778.
  6. 1 2 3 4 5 MONAI Label, Project MONAI, 2023-07-06, retrieved 2023-07-06
  7. "OHIF Viewer". viewer.ohif.org. Retrieved 2023-07-06.
  8. "QuPath". qupath.github.io. Retrieved 2023-07-06.
  9. "Harness the full potential of your digital pathology data". Digital Slide Archive. Retrieved 2023-07-06.
  10. "Descending into ML: Training and Loss | Machine Learning". Google for Developers. Retrieved 2023-07-06.
  11. An, Li-Bao (2011). "Cutting parameter optimization in milling operations by various solution methods". 2011 International Conference on Machine Learning and Cybernetics. IEEE. pp. 422–427. doi:10.1109/icmlc.2011.6016679. ISBN   978-1-4577-0305-8. S2CID   12484876.
  12. "Loss functions — MONAI 1.2.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  13. "Optimizers — MONAI 1.2.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  14. 1 2 "Metrics — MONAI 1.2.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  15. "DLProf User Guide" (PDF). docs.nvidia.com. Retrieved 2023-07-06.
  16. "Nsight Systems". NVIDIA Developer. 2018-03-12. Retrieved 2023-07-06.
  17. "NVIDIA Tools Extension (NVTX) Documentation". docs.nvidia.com. Retrieved 2023-07-06.
  18. "NVML API Reference Guide :: GPU Deployment and Management Documentation". docs.nvidia.com. Retrieved 2023-07-06.
  19. "PyTorch-Ignite". PyTorch-Ignite. Retrieved 2023-07-06.
  20. "XLA: Optimizing Compiler for Machine Learning". TensorFlow. Retrieved 2023-07-06.
  21. "Slurm Workload Manager - Documentation". slurm.schedmd.com. Retrieved 2023-07-06.
  22. "MONAI Model Zoo". monai.io. Retrieved 2023-07-06.
  23. "Model Bundle — MONAI 1.2.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  24. "Deploying and Hosting MONAI App Package — MONAI Deploy App SDK 0.4.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  25. "Packaging app — MONAI Deploy App SDK 0.4.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  26. "Executing packaged app locally — MONAI Deploy App SDK 0.4.0 Documentation". docs.monai.io. Retrieved 2023-07-06.
  27. Li, Jianning; Ferreira, André; Puladi, Behrus; Alves, Victor; Kamp, Michael; Kim, Moon; Nensa, Felix; Kleesiek, Jens; Ahmadi, Seyed-Ahmad; Egger, Jan (2023). "Open-source skull reconstruction with MONAI". SoftwareX. 23: 101432. arXiv: 2211.14051 . Bibcode:2023SoftX..2301432L. doi:10.1016/j.softx.2023.101432. ISSN   2352-7110.
  28. Sharma, Suraj Prakash; Sampath, Nalini (2022-06-24). "Data Augmentation for Brain Tumor Segmentation using MONAI Framework". 2022 2nd International Conference on Intelligent Technologies (CONIT). IEEE. pp. 1–8. doi:10.1109/conit55038.2022.9847822. ISBN   978-1-6654-8407-7. S2CID   251761810.
  29. Stollmayer, Róbert; Budai, Bettina Katalin; Rónaszéki, Aladár; Zsombor, Zita; Kalina, Ildikó; Hartmann, Erika; Tóth, Gábor; Szoldán, Péter; Bérczi, Viktor; Maurovich-Horvat, Pál; Kaposi, Pál Novák (2022-05-05). "Focal Liver Lesion MRI Feature Identification Using Efficientnet and MONAI: A Feasibility Study". Cells. 11 (9): 1558. doi: 10.3390/cells11091558 . ISSN   2073-4409. PMC   9104155 . PMID   35563862.
  30. Belue, Mason J.; Harmon, Stephanie A.; Patel, Krishnan; Daryanani, Asha; Yilmaz, Enis Cagatay; Pinto, Peter A.; Wood, Bradford J.; Citrin, Deborah E.; Choyke, Peter L.; Turkbey, Baris (2022). "Development of a 3D CNN-based AI Model for Automated Segmentation of the Prostatic Urethra". Academic Radiology. 29 (9): 1404–1412. doi:10.1016/j.acra.2022.01.009. ISSN   1076-6332. PMC   9339453 . PMID   35183438.
  31. Poon, Charissa; Teikari, Petteri; Rachmadi, Muhammad Febrian; Skibbe, Henrik; Hynynen, Kullervo (2022-07-20). "MiniVess: A dataset of rodent cerebrovasculature from in vivo multiphoton fluorescence microscopy imaging": 1–11. doi:10.1101/2022.07.19.500542.{{cite journal}}: Cite journal requires |journal= (help)
  32. Kaliyugarasan, Satheshkumar; Lundervold, Arvid; Lundervold, Alexander Selvikvåg (2021). "Pulmonary Nodule Classification in Lung Cancer from 3D Thoracic CT Scans Using fastai and MONAI". International Journal of Interactive Multimedia and Artificial Intelligence. 6 (7): 83. doi: 10.9781/ijimai.2021.05.002 . hdl: 11250/2772280 . ISSN   1989-1660. S2CID   236693622.
  33. Durlach, Peter (2023-05-11). "Microsoft and our partners: Bringing AI models to clinical settings". Microsoft Industry Blogs. Retrieved 2023-07-06.
  34. Dennison, Jaisil Rose (2022-08-11). "Leveraging MONAI for Medical AI". www.kitware.com. Retrieved 2023-07-06.

Further reading