Learn how Altilia is leveraging GPT and Large Language Models to enhance its IDP platform

Taking on the challenge of managing complex Machine Learning Operations

By altilia on February 1, 2023

As we explored in our previous blog, demand for AI-based solutions is growing year on year as organizations adopt and experiment with artificial intelligence to drive competitiveness and efficiency in a tough business environment.

We examined the key talent and skills gap challenge that companies, particularly small and medium enterprises, are facing as the explosion of AI projects outstrips the supply of qualified data experts required.

Now we want to go deep on another significant issue that has been identified by business leaders and consultants as a blocker to growth. And also show how Altilia can offset this challenge.

Managing Machine Learning (ML) Operations

When companies consider introducing new business process automation solutions, they often underestimate the complexity of managing the lifecycle of an AI project.

ML Ops are defined as a set of practices that combine the implementation of Machine Learning, DevOps, and Data Engineering models, with the goal of developing, releasing, monitoring, and the scaling into production high quality ML systems. This is done by leveraging organizational, cultural, and technological aspects that support the governance and automation of the ML model lifecycle.

Properly managing ML Ops is a major challenge even for large and structured companies.

In fact, according to a recent survey by the Artificial Intelligence Observatory of the Polytechnic University of Milan, out of a sample of 80 large companies operating in Italy that have already started AI projects, 71% say they have not yet introduced ML Ops as a structured practice.

Among the stated reasons for this delay stands out the lack of time they devote to AI initiatives compared to traditional business activities (43%) and the lack of internal expertise to manage ML Ops (33%), an issue which brings us back to the challenge discussed in our last blog.

Deloitte identified this ML Ops issue in their State of AI report: “Despite evidence that establishing clear processes and redefining roles to deliver quality AI results in improved outcomes, there has been little growth in the market in terms of adopting such practices, according to survey respondents.

In both the fourth and fifth editions, just one-third of respondents reported that their companies are always following MP Ops, redesigning workflows and documenting AI model life cycles.”

The ML Ops cycle in its complexity could be broken down in three distinctive parts:

  • The Business process analysis/understanding part refers to the mapping of internal processes, systems, and any other key element to understand the “problem” to be solved: such as defining requirements, objectives, key outcomes, and formulating use case hypotheses for ML applications.
  • The Data & Model Preparation part concerns the collection and refinement of data to train the ML model; the selection of the ML model with the best fit to solve the problem; the testing of different algorithms, features and hyperparameters; keeping track of all experiments, and maximizing code reusability.
  • Continuous Operations & Improvement refers to the continuous update of the solution with each product release; the continuous training of the algorithm, with the storage and processing of new data to update the model; the continuous monitoring of the model to keep track of changes in performance and accuracy.

How can Altilia help to overcome the ML Ops challenge?

Altilia’s approach is offering a comprehensive platform to manage the whole AI implementation lifecycle with an end-to-end approach.

The platform is built overtime and improved upon experience by Altilia, to hide the complexity of managing ML Ops, and to simplify the training and the optimization of AI models over time.

This eliminates the need to dedicate an internal team (with limited experience) just to maintain an AI implementation project, to monitor its performance over time and to manage its infrastructure.

Additionally, the automation challenges faced by businesses are rarely completely unique. Using a platform approach, Altilia can replicate pre-tested use case specific solutions and adapt them to perfectly fit the needs of the customer.

This greatly simplifies the process analysis, reducing the uncertainty when defining implementation goals, a leading to a greater confidence about the expected outcomes.

Altilia simplifies data and model preparation by giving access to library of generalized pre-built AI models, including Large Language Models (LLMs) that can be trained and adapted, according to the customer’s specific requirements.

The training of models is simplified, thanks to our document annotation interface, that allows users to easily generate examples for the system to process.

Lastly, Altilia facilitates the monitoring and fine-tuning of AI models to increase their accuracy over time and prevent data drift.

Contact Altilia here to learn how our unique AI technology platform can help your organization overcome these challenges.

By altilia on February 1, 2023

Explore more stories like this one

How the technology behind Chat GPT can work for your organization

The explosion of interest and publicity in Artificial Intelligence in recent months has come from the advent of Large Language Models, specifically OpenAI’s ChatGPT, which set the record for the fastest-growing user base in January. Suddenly it seems like everyone is fascinated by the coming surge of AI with new applications, creating excitement and fear for the future. When Google’s so-called “Godfather of AI” Dr Geoffrey Hinton warned about “quite scary” dangers, it made headlines around the world. Behind the hype So, it is important to understand what is behind the hype and see how it works and what your organization can use to build future value. This blog is split into two: first we learn about Natural Language Processing, the branch of computer science concerned with giving machines the ability to understand text and spoken words in much the same way humans can. And then we will go deeper on Large Language Models (LLMs), which is what ChatGPT and others like Google’s Bard are using. NLP combines computational linguistics with statistical, machine learning, and deep learning models to enable computers to process human language in the form of text or voice data and to ‘understand’ its full meaning, complete with the speaker or writer’s intent and sentiment. NLP drives computer programs that translate text from one language to another, respond to spoken commands, and summarize large volumes of text rapidly—even in real time. There’s a good chance you’ve interacted with NLP in the form of voice-operated GPS systems, digital assistants, speech-to-text dictation software, customer service chatbots, and other consumer conveniences. But NLP also plays a growing role in enterprise solutions that help streamline business operations, increase employee productivity, and simplify mission-critical business processes. There are two sub-fields of NLP: Natural Language Understanding (NLU) uses syntactic and semantic analysis of text and speech to determine the meaning of a sentence, similarly to how humans do it naturally. Altilia uses Large Language Models for this. Natural Language Generation (NLG) enables computers to write a human language text response based on data input. ChatGPT uses LLMs for NLG. Large Language Models (LLMs) LLMs are a relatively new approach where massive amounts of text are fed into the AI algorithm using unsupervised learning to create a “foundation” model, which can use transfer learning to continually learn new tasks. The key is using huge volumes of data. The training data for ChatGPT comes from a diverse set of text sources, including billions of web pages from the internet, a huge number of books from different genres, articles from news websites, magazines and academic journals and social media platforms such as Twitter, Reddit and Facebook to learn about informal language and the nuances of social interactions. The model is then able to predict the next word in a sentence and generate coherent text in a wide range of language tasks. Altilia does exactly the same, but uses this capability to provide enterprise tools for specific business use cases. Technology breakthrough Overall, NLP is the core technology to understand the content of documents. LLMs are a breakthrough in the field as they allow a shift from where an NLP model had to be trained in silos for a specific task to one where LLMs can leverage accumulated knowledge with transfer learning. In practice, this means we can apply a pre-trained LLM and fine-tune it with a relatively small dataset to allow the model to learn new customer-specific or use-case specific tasks. We are then able to scale up more effectively, it can be applied more easily for different use cases, leading to a higher ROI. For more information on how Altilia Intelligent Automation can support your organization to see radical improvements in accuracy and efficiency, schedule a demo here.

Read more

Leveraging GPT and Large Language Models to enhance Intelligent Document Processing

The rise of Artificial Intelligence has been the talk of the business world since the emergence of ChatGPT earlier this year. Now executives around the world find themselves in need of understanding the importance and power of Large Language Models in delivering potentially ground-breaking use cases that can bring greater efficiency and accuracy to mundane tasks. Natural Language Generation (NLG) enables computers to write a human language text response based on human generated prompts. What few understand is that there is still a deep flaw in the ChatGPT technology: up to 20-30% of all results have inaccuracies, according to Gartner. What Gartner have found is that ChatGPT is “susceptible to hallucinations and sometimes provides incorrect answers to prompts. It also reflects the deficiencies of its training corpus, which can lead to biased or inappropriate responses as well as algorithmic bias.” To better understand this, it’s key to consider how LLMs work: hundreds of billions of pieces of training data are fed into the model, enabling it to learn patterns, associations, and linguistic structures. This massive amount of data allows the model to capture a wide range of language patterns and generate responses based on its learned knowledge. However, as vast training data can be, the model can only generate responses as reliable as the information it has been exposed to. If it encounters a question or topic that falls outside the training data or knowledge cutoff, responses may be incomplete or inaccurate. For this reason, and to better understand how best to use LLMs in enterprise environments, Gartner outlined a set of AI Design Patterns and ranked them by difficulty of each implementation. We are delighted to share that Altilia Intelligent Automation already implements in its platform two of the most complex design patterns: LLM with Document Retrieval or Search This provides the potential to link LLMs with internal document databases, unlocking key insights from internal data with LLM capabilities This provides much more accurate and relevant information, reducing the potential for inaccuracies due to the ability to the use of retrieval. Fine-tuning LLM The LLM foundation model is fine-tuned using transfer learning with an enterprise’s own documents or particular training dataset, which updates the underlying LLM parameters. LLMs can then be customized to specific use cases, providing bespoke results and improved accuracy. So, while the business and technology world has been getting excited by the emergence of ChatGPT and LLMs, Altilia has already been providing tools to enterprises to leverage these generative AI models to their full potential. And by doing so, thanks to its model’s fine-tuning capabilities, we are able to overcome the main limitation of a system like OpenAI’s ChatGPT, which is the lack of accuracy of its answers. For more information on how Altilia Intelligent Automation can help your organization, schedule a free demo here.

Read more

How to use AI to discover the hidden meaning in complex documents

Welcome to our third blog of a series uncovering the key components of Artificial Intelligence to provide greater understanding for business leaders who may currently have FOMO (Fear Of Missing Out) from the blizzard of acronyms and hype. Here, we look at Computer Vision, one of the main applications of AI where computers can be made to gain high-level of understanding from digital images or videos. Critically, Computer Vision is concerned with automatic extraction of data, enabling documents that have handwriting and random layouts to become machine-readable. Huge data volumes Computer Vision needs a lot of data to be able to distinguish and recognize images. In a way, it looks like a jigsaw puzzle where you assemble all the scattered tiles to make an image. Neural networks for CV work on the same principle. Yet the computer does not have the final image, but it is fed hundreds, if not thousands of related images that train it to recognize specific objects. To identify a cat, the computer would not be shown individual elements such as ears, whiskers, tail etc, but millions of pictures of cats so that it can model the features of our feline friends. CV is used for visual surveillance, medical image processing for patient diagnosis and navigation by autonomous vehicles. But in Altilia’s development of Intelligent Document Processing (IDP), CV has several key roles to play. With Optical Character Recognition (OCR) and Intelligent Character Recognition (ICR), we are able to convert scanned documents into machine-readable PDFs and with Handwritten Text Recognition (HTR) are incorporate items such as signatures. End goal The end goal of an IDP solution is to extract meaningful information that are “hidden” in unstructured texts and documents, so we need to first break words down in a way that a machine can understand. This is especially relevant when the documents that need to be processed are (low quality) scans such as contracts, forms, invoices or ID cards. We then need to apply OCR to recognize both printed and handwritten text, using smaller units called tokens. To each token is added metadata, which is useful later in a search engine. In IDP, it is useful to distinguish a photo from text and to tag elements such as signatures, stamps and markings, saving human labor time by automating checks such as whether a contract is signed and marked. Finally, we focus on document layout analysis so that unsorted documents can be classified and then we can apply different machine learning algorithms and branch out different ML pipelines. These core capabilities allow Altilia’s solution to work as a general purpose platform, rather than a point solution for specific document types and formats. We have also developed a patented solution for document layout analysis. For more information on how Altilia Intelligent Automation can help your organization, schedule a free demo here.

Read more