Aleph Alpha

Last updated
Aleph Alpha GmbH
Company type Private
Industry Artificial intelligence
Founded2019;5 years ago (2019)
Founders
  • Jonas Andrulis
  • Samuel Weinbach
Headquarters Heidelberg, Germany
ProductsLuminous LLM
Number of employees
51-200 [1]  (2024)
Website aleph-alpha.com

Aleph Alpha GmbH is a German artificial intelligence (AI) startup company founded by Jonas Andrulis and Samuel Weinbach, both of whom have professional experience at companies such as Apple and Deloitte. [2] Based in Heidelberg, the company aims to develop a sovereign technology stack for generative AI that operates independently of U.S. companies and complies with European data protection regulations, including the Artificial Intelligence Act. Aleph Alpha has established reportedly one of the most powerful AI clusters within its own data center, [3] and specializes in developing large language models (LLM). These models are designed to provide transparency regarding the sources used for generating results and are intended for use by enterprises and governmental agencies. [4] The training of these models has been conducted in five European languages. [5]

Contents

Andrulis DLD.jpg
CEO Jonas Andrulis

History

Aleph Alpha was founded in 2019 by Jonas Andrulis and Samuel Weinbach. Andrulis holds a degree in economics engineering from the Karlsruhe Institute of Technology, where his thesis focused on artificial intelligence. His professional experience includes consulting at Deloitte and founding several AI software companies. Prior to founding Aleph Alpha, he served as an AI R&D engineering manager at Apple's Special Projects Group and worked on classified research with Siri AI R&D. [6] Weinbach, who holds a degree in business administration, worked at Deloitte from 2010, where he was involved in establishing the Deloitte Analytic Institute, which focused on advancing corporate AI initiatives. [2]

Funding

Products

Luminous

Aleph Alpha developed its own AI language model, named Luminous, based on its own research and codebase with the architecture of generative pre-trained transformers (GPT) with self-supervised learning. Next to the standard functionality all GPT models share Aleph Alpha contributed some proprietary innovation:

As a tool to build and train its foundation models, the HPE Machine Learning Development System is used. [18] Using the GPT-type concept allows adaptation and fine-tuning of the foundation model to various applications. [19]

Luminous is already used for the citizen information system Lumi of the city of Heidelberg. [20]

Partnerships

See also

Related Research Articles

<span class="mw-page-title-main">HP Autonomy</span> Defunct British software company

HP Autonomy, previously Autonomy Corporation PLC, was an enterprise software company which was merged with Micro Focus in 2017 and OpenText in 2023. It was founded in Cambridge, United Kingdom in 1996.

A superintelligence is a hypothetical agent that possesses intelligence surpassing that of the brightest and most gifted human minds. "Superintelligence" may also refer to a property of problem-solving systems whether or not these high-level intellectual competencies are embodied in agents that act in the world. A superintelligence may or may not be created by an intelligence explosion and associated with a technological singularity.

Anthropic PBC is a U.S.-based artificial intelligence (AI) public-benefit startup founded in 2021. It researches and develops AI to "study their safety properties at the technological frontier" and use this research to deploy safe, reliable models for the public. Anthropic has developed a family of large language models (LLMs) named Claude as a competitor to OpenAI's ChatGPT and Google's Gemini.

The Machine is the name of an experimental computer made by Hewlett Packard Enterprise. It was created as part of a research project to develop a new type of computer architecture for servers. The design focused on a “memory centric computing” architecture, where NVRAM replaced traditional DRAM and disks in the memory hierarchy. The NVRAM was byte addressable and could be accessed from any CPU via a photonic interconnect. The aim of the project was to build and evaluate this new design.

<span class="mw-page-title-main">Databricks</span> American software company

Databricks, Inc. is a global data, analytics, and artificial intelligence company founded by the original creators of Apache Spark.

<span class="mw-page-title-main">Hewlett Packard Enterprise</span> American information technology company

The Hewlett Packard Enterprise Company (HPE) is an American multinational information technology company based in Spring, Texas.

OpenAI is an American artificial intelligence (AI) research organization founded in December 2015 and headquartered in San Francisco, California. Its mission is to develop "safe and beneficial" artificial general intelligence (AGI), which it defines as "highly autonomous systems that outperform humans at most economically valuable work". As a leading organization in the ongoing AI boom, OpenAI is known for the GPT family of large language models, the DALL-E series of text-to-image models, and a text-to-video model named Sora. Its release of ChatGPT in November 2022 has been credited with catalyzing widespread interest in generative AI.

<span class="mw-page-title-main">Writer Inc.</span>

Writer is a generative artificial intelligence company based in San Francisco, California that offers a full-stack generative AI platform for enterprises. In September 2023, Writer raised $100m in a Series B led by ICONIQ Growth with participation from Insight Partners, WndrCo, Balderton Capital, and Aspect Ventures. The co-founders also worked together on Qordoba, a previous startup.

<span class="mw-page-title-main">VAST Data</span> Artificial intelligence company

VAST Data is a privately held technology company focused on artificial intelligence (AI) and deep learning computing infrastructure. Founded in 2016, the company offers a data computing platform that allows users to train AI models by storing and synthesizing large amounts of unstructured data.

<span class="mw-page-title-main">Cerebras</span> American semiconductor company

Cerebras Systems Inc. is an American artificial intelligence (AI) company with offices in Sunnyvale, San Diego, Toronto, and Bangalore, India. Cerebras builds computer systems for complex AI deep learning applications.

Prompt engineering is the process of structuring an instruction that can be interpreted and understood by a generative artificial intelligence (AI) model. A prompt is natural language text describing the task that an AI should perform. A prompt for a text-to-text language model can be a query such as "what is Fermat's little theorem?", a command such as "write a poem in the style of Edgar Allan Poe about leaves falling", or a longer statement including context, instructions, and conversation history.

<span class="mw-page-title-main">ChatGPT</span> Chatbot developed by OpenAI

ChatGPT is a generative artificial intelligence (AI) chatbot developed by OpenAI and launched in 2022. It is based on the GPT-4o large language model (LLM). ChatGPT can generate human-like conversational responses, and enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language. It is credited with accelerating the AI boom, which has led to ongoing rapid investment in and public attention to the field of artificial intelligence. Some observers have raised concern about the potential of ChatGPT and similar programs to displace human intelligence, enable plagiarism, or fuel misinformation.

<span class="mw-page-title-main">Generative pre-trained transformer</span> Type of large language model

A generative pre-trained transformer (GPT) is a type of large language model (LLM) and a prominent framework for generative artificial intelligence. It is an artificial neural network that is used in natural language processing by machines. It is based on the transformer deep learning architecture, pre-trained on large data sets of unlabeled text, and able to generate novel human-like content. As of 2023, most LLMs had these characteristics and are sometimes referred to broadly as GPTs.

A large language model (LLM) is a type of computational model designed for natural language processing tasks such as language generation. As language models, LLMs acquire these abilities by learning statistical relationships from vast amounts of text during a self-supervised and semi-supervised training process.

<span class="mw-page-title-main">AI boom</span> Ongoing period of rapid progress in artificial intelligence

The AI boom, or AI spring, is an ongoing period of rapid progress in the field of artificial intelligence (AI) that started in the late 2010s before gaining international prominence in the early 2020s. Examples include protein folding prediction led by Google DeepMind as well as large language models and generative AI applications developed by OpenAI.

Suparna Bhattacharya is an Indian computer scientist known for her contributions to the Linux kernel, and also interested in applications of big data in artificial intelligence. She is an Hewlett Packard Enterprise Fellow at Hewlett Packard Labs.

Runway AI, Inc. is an American company headquartered in New York City that specializes in generative artificial intelligence research and technologies. The company is primarily focused on creating products and models for generating videos, images, and various multimedia content. It is most notable for developing the commercial text-to-video and video generative AI models Gen-1, Gen-2 and Gen-3 Alpha.

<span class="mw-page-title-main">Mistral AI</span> French artificial intelligence company

Mistral AI is a French company specializing in artificial intelligence (AI) products. Founded in April 2023 by former employees of Meta Platforms and Google DeepMind, the company has quickly risen to prominence in the AI sector.

<span class="mw-page-title-main">IBM Granite</span> 2023 text-generating language model

IBM Granite is a series of decoder-only AI foundation models created by IBM. It was announced on September 7, 2023, and an initial paper was published 4 days later. Initially intended for use in the IBM's cloud-based data and generative AI platform Watsonx along with other models, IBM opened the source code of some code models. Granite models are trained on datasets curated from Internet, academic publishings, code datasets, legal and finance documents.

<span class="mw-page-title-main">Alps (supercomputer)</span>

The Alps supercomputer is a high-performance computer funded by the Swiss Confederation through the ETH Domain, with its main location in Lugano. It is part of the Swiss National Supercomputing Centre (CSCS), which provides computing services for selected scientific customers.

References

  1. "Aleph Alpha News, Hiring, Layoffs, Competitors, CEO, Fundraising Insights". RivalSense. Retrieved 28 October 2024.
  2. 1 2 Chatbot-Konkurrenz aus Deutschland: Auf Wiedersehen, ChatGPT - und willkommen Aleph Alpha. finanzen.net, 2023-03-15 (in German). Retrieved 2023-11-10.
  3. Schreiner, Maximilian (2022-09-16). "OpenAI competitor Aleph Alpha launches Europe's fastest commercial AI data center". THE DECODER. Retrieved 2024-04-14.
  4. Maximilian Schreiner: AI in practice: AI startup Aleph Alpha shows off latest LLMs with a unique feature. The Decoder, 2023-06-05. Retrieved 2023-11-11
  5. 1 2 Accelerating Europe’s multilingual AI revolution. Hewlett Packard Enterprise, 2022. Retrieved 2023-11-11.
  6. Systems, Eulerpool Research. "L'histoire de réussite de Jonas Andrulis sous les projecteurs". Eulerpool Research Systems (in French). Retrieved 2024-04-14.
  7. Tucker, Charlotte (2021-01-27). "Heidelberg-based Aleph Alpha raises €5.3 million to lead "Made in Europe" AI development". EU-Startups. Retrieved 2024-04-14.
  8. Earlybird leads Aleph Alpha's 23 million EURO Serie A for the largest European AI models. press release, earlybird.com, 2021-07-27. Retrieved 2023-11-11.
  9. Generative AI Investments Aleph Alpha, Anthropic and Cohere. SAP news, 2023-07-18. Retrieved 2023-11-11.
  10. Aleph Alpha raises a total investment of more than half a billion US Dollars from a consortium of industry leaders and new investors. Press Release, aleph-alpha.com, 2023-11-06. Retrieved 2023-11-11.
  11. Aggi Cantrill and Mark Bergen: German Giants Pour Over $500 Million Into AI Startup Aleph Alpha. Bloomberg News, 2023-11-06. Retrieved 2023-11-11.
  12. online, heise (2021-11-18). "KI-Modell kann Bilder beschreiben: Aleph Alpha ist Vorreiter für multimodale KI". Developer (in German). Retrieved 2024-04-14.
  13. online, heise (2022-03-16). "GPT-3 überflügeln: Quellcode des KI-Modells MAGMA steht auf GitHub". Developer (in German). Retrieved 2024-04-14.
  14. online, heise (2022-12-09). "KI-Bildsynthese: M-VADER erstellt Bilder aus beliebigen Text- und Bildvorgaben". Developer (in German). Retrieved 2024-04-14.
  15. "Aleph Alpha Forschungen: NeurIPS Highlights – ainfach.ai" (in German). 2023-12-08. Retrieved 2024-04-14.
  16. Schreiner, Maximilian (2023-06-05). "AI startup Aleph Alpha shows off latest LLMs with a unique feature". THE DECODER. Retrieved 2024-04-14.
  17. "Handelsblatt". www.handelsblatt.com. Retrieved 2024-04-14.
  18. Hewlett Packard Enterprise accelerates AI journey from POC to production with new solution for AI development and training at scale. Press Release, HPE, 2022-04-27. Retrieved 2023-11-11.
  19. Next-level customizability. aleph-alpha.com. Retrieved 2023-11-11.
  20. KI-Bürgerassistenz Lumi. heidelberg.de, 2023 (in German). Retrieved 2023-11-11.
  21. Matthias Hohensee: Wir verbünden uns mit den besten Unternehmen der Welt. In: WirtschaftsWoche, 2023-06-22 (in German). Retrieved 2023-11-11.