Artificial Intelligence or Artificial Stupidity?
The concept of artificial intelligence (AI) can be traced back to pioneering researchers such as Charles Babbage and Ada Lovelace, who developed early mechanical computers, laying the groundwork for what was to follow. Later, Alan Turing, followed by Isaac Asimov and Gene Roddenberry brought the concept of AI into the public consciousness. This combination of academic research alongside the cultural influence of Star Trek has given us two generations of researchers who realized that AI need not be confined to a sci-fi show or an academic paper but could be developed in reality.
Early attempts to develop AI have persisted since the 1940s and the advent of the first electromechanical computers, with developments gathering pace as the first microprocessor-based computers were introduced. It is only in very recent times that AI has truly become a genuine possibility—the result of many different strands finally converging—that truly meaningful AI has arrived. What has proved truly transformational in recent years is not processing power alone, but access to training material on an unprecedented scale. Rapid advances in internet connectivity have unlocked vast repositories of human knowledge—books, magazines, academic journals, and archives spanning population, geographical, and financial data—giving AI systems the raw material needed to develop genuine capability.
For most AI engines, such as those which can be used to generate images, this training material has proved enormously effective. With access to the vast databases of images, artwork, and films, they are highly capable, and have inherent truth built into what they’re being trained with. A Picasso is a Picasso; a photograph of Donald Trump is what it purports to be. There is little inherent uncertainty in the training data, and the outputs reflect that reliability, whether the task is generating artwork in a particular style or producing entirely novel imagery.
That is not the case when it comes to science; we’re training AI engines, including those used to analyze our chromatography and mass spectrometry data, using datasets that are not “ground truth” or in many cases, remotely accurate. This is precisely where the concept of artificial stupidity takes hold.
Mass spectrometry-related AI presents an area of grave concern. There remains, to this day, a healthy discourse about analyte identification in high-resolution, untargeted datasets, with a substantial proportion of the community arguing that many discreet compounds identified in datasets are the result of in-source fragmentation.1,2 There also remains concerns around the appropriate handling of isotopic distributions, particularly in data from instruments that lack suitable isotopic fidelity, alongside similar concerns around correct adduct identification.3 This leads to a situation where the compound identifications, fragmentation patterns, isotopic distributions, and adduct species used to train AI-based mass spectrometry data processing tools are so fundamentally inaccurate that the resulting AI package generates more inaccurate compound identifications. This will in turn generate more data that is used to train, retrain, and refine models, amplifying underlying inaccuracy rather than producing increasingly more accurate data and embedding errors ever more deeply into the system.
This can be partially resolved by training AI models with data from targeted modelling, though even in this area, it may not be possible to provide the level of reliable, ground truth data that is really needed to enable accurate AI-based compound identification. In recent years, we have seen egregious errors in retention time and compound-peak annotation even in targeted data, once again introducing inaccury and errors in the datasets used to iterate successive generations of AI.4.5
As I write this, the US has placed export controls on some of the very latest AI technologies — so not only are we contending with inaccurate training materials giving rise to AI models that will produce inaccurate answers when challenged with mass spectrometry data, but we now have export control legislation limiting who can access which models. The obvious concern is that researchers in different locations will end up working with models that interpret the same datasets in different ways, further undermining whatever accuracy AI might otherwise bring to the analysis of complex mass spectrometry data.6
The good news is there’s nothing which can’t be fixed but there will need to be community consensus on how best to proceed. This is something which might best be progressed by the American Society for Mass Spectrometry (ASMS) and/or the Metabolomics Society. These two organisations run annual conferences which would be the perfect place for the community to come together, agree on some principles and perhaps establish the appropriate working groups that can best agree on the necessary convergence of different AI models and a suitable framework that can be used to ensure regardless of the AI models used, the same data going in will result in the same data coming out the other end.
We’ve seen the community come together with some nice perspective publications in the past and I feel this is a topic well suited to some similar treatment. 6
Related Stories
AI News
Suriya entering politics? actor's fan club issues clarification - INDIA
7 seconds ago
AI News
'One nation, one law is the demand across India': BJP President Samik Bhattacharya on West Bengal UCC Bill
11 seconds ago
AI News
US ‘light artist’ opens his most daring skyspace yet
16 seconds ago
AI News
Most of council
20 seconds ago
AI News
Shift observed on Trinidad’s Galfa shoreline following Venezuela earthquakes
22 seconds ago
AI News
Police in Kitchener warn fines will be issued if fireworks are set off in Victoria Park on Canada Day
24 seconds ago
AI News
New research suggests invasive spotted lanternfly may be able to survive Canadian winters
28 seconds ago
AI News
Maruti Suzuki onboards 5 startups to co
4 minutes ago