MCCHE Precision Convergence Webinar Series with Barend Mons

Main navigation

Event

MCCHE Precision Convergence Webinar Series with Barend Mons

Thursday, October 31, 2024 11:00to13:00

Add to calendar

STOP SHARING DATA: Visiting Algorithms, Swarm Learning and Next Generation FAIR (Federated AI Ready) Principles and Practice

By Barend Mons

Leiden University

Date:聽October 31, 2024
Time:聽11:00am to 1:00pm

View poster

Abstract

The rapid developments in the field of machine learning have also brought along some existential challenges, which are in essence all related to the broad concept of 鈥榯rust鈥�. Aspects of this broad concept include trust in the output of any ML process (and the prevention of black boxes, hallucinations and so forth). The very trust in science is at stake, especially now that LLMs can generate 鈥榞ood-looking nonsense鈥� and paper mills come up in response to the perverse reward systems in current research environments. The other side of the same coin is that ML, if nor properly controlled will also break through security and privacy barriers and violate GDPR and other Ethical, Legal and Societal barriers, including equitability. In addition, the existence of data 鈥榮omewhere鈥� by no means automatically implies its actual Reusability. This includes the by now well established four elements of the FAIR principles: Much data is not even Findable, if found, not Accessible under well defined conditions, and if accessed not Interoperable (understandable by third parties and machines) and this results in the vast majority of data and information not being Reusable without violation of copyrights, privacy regulations or the basic conceptual models that implicitly or explicitly underpin the query or the deep learning algorithm. Now that more and more data will also be 鈥榠ndependently鈥� used by machines, all these challenges will be severely aggravated. This keynote will address how 鈥榙ata visiting鈥� as opposed to classical 鈥榙ata sharing鈥�, which carries the onnotation of data downloads, transport and loosing control, mitigates most, if not all, the unwanted side effects of classical 鈥榙ata sharing鈥�. For federated data visiting, the data should be FAIR in an additional sense or perspective, they should be 鈥楩ederated, AI-Ready鈥�, so that visiting algorithms can answer questions related to Access Control, Consent, Format, and can read rich (FAIR) metadata about the data itself to determine whether they are 鈥榝it for purpose鈥� and machine actionable (i.e. FAIR digital Objects, or Machine Actionable Units). The 鈥榝itness for purpose鈥� concept goes way beyond (but includes) information about methods, quality, error bars etc. The 鈥榠mmutable logging鈥� of all operation of visiting algorithms is crucial, especially when self learning algorithmsin 鈥榮warm learning鈥� are being used. Enough to keep us busy for a while.

海角社区

Main navigation