Advances in Financial AI

Advances in Financial AI:
Innovations, Risk, and Responsibility

in the Era of LLMs

Advances in Financial AI:
Innovations, Risk, and Responsibility

in the Era of LLMs

Advances in Financial AI:
Innovations, Risk, and Responsibility

in the Era of LLMs

Workshop at CIKM 2025

Registration

Workshop Abstract

The finance sector is seeing a rapid increase in the application of machine learning and AI, with Large Language Models (LLMs), ESG (Environmental, Social, and Governance) investing, and AI Safety significantly reshaping the field. This workshop focuses on how these advancements intersect with core financial AI applications. We will foster interdisciplinary discussion on applying LLMs to finance, addressing challenges in multilingual and non-English markets like Korea. The event will also highlight the integration of ESG signals into algorithmic decision-making and explore AI Safety, emphasizing reliability, fairness, and explainability for AI systems in regulated financial environments. By bringing together experts from academia, industry, and regulatory bodies, the workshop aims to stimulate discussions on practical issues, ethical dilemmas, and cutting-edge research shaping financial AI's future. We welcome submissions that combine technical rigor with societal relevance in AI-driven financial decisions.

Call for Papers

Submit

Workshop Theme and Topics

This workshop explores the evolving landscape of machine learning and AI applications in finance, with a particular focus on the emerging roles of Large Language Models (LLMs), responsible AI practices, and risk-aware decision-making. As financial institutions increasingly adopt AI technologies, the demand for interpretable, trustworthy, and domain-aligned models is higher than ever. This workshop aims to bridge technical advances with real-world constraints, regulatory expectations, and ethical considerations.

Topics of Interest include, but are not limited to:

LLMs for financial applications: fine-tuning, prompting, retrieval, and evaluation
Risk modeling and robust learning under distributional shifts in financial data
AI safety, fairness, and explainability in high-stakes financial environments
Multi-lingual and region-specific challenges in applying AI to finance
Responsible AI development and deployment in fintech, banking, and asset management

Workshop Theme and Topics

Topics of Interest include, but are not limited to:

LLMs for financial applications: fine-tuning, prompting, retrieval, and evaluation
Risk modeling and robust learning under distributional shifts in financial data
AI safety, fairness, and explainability in high-stakes financial environments
Multi-lingual and region-specific challenges in applying AI to finance
Responsible AI development and deployment in fintech, banking, and asset management

Workshop Theme and Topics

Topics of Interest include, but are not limited to:

LLMs for financial applications: fine-tuning, prompting, retrieval, and evaluation
Risk modeling and robust learning under distributional shifts in financial data
AI safety, fairness, and explainability in high-stakes financial environments
Multi-lingual and region-specific challenges in applying AI to finance
Responsible AI development and deployment in fintech, banking, and asset management

Submission
Guidelines

Submission Guidelines

Submission
Guidelines

We welcome original research contributions, as well as working papers that foster discussion among students, academic researchers, and industry professionals. All submissions must adhere to the CIKM formatting guidelines, available at: https://www.acm.org/publications/proceedings-template.

Long Papers: Up to 8 pages, excluding references and appendices.
Short Papers: Up to 4 pages, excluding references and appendices.

At least one author of each accepted paper must register for and attend the workshop in person to present their work.

Failure to do so will result in the withdrawal of the paper from the workshop.

Long Papers: Up to 8 pages, excluding references and appendices.
Short Papers: Up to 4 pages, excluding references and appendices.

At least one author of each accepted paper must register for and attend the workshop in person to present their work.

Failure to do so will result in the withdrawal of the paper from the proceedings.

Review and
Acceptance Process

Review and
Acceptance
Process

Submission System & Process

Submissions are to be made through the OpenReview platform. Authors must submit the paper content (PDF document), title, author names, contact details, and a brief abstract electronically through the workshop’s submission site. At least one author of each accepted paper is required to attend the conference to present their work.

Review Process

This workshop will follow a single-blind review process, where authors’ identities are known to the reviewers, but reviewers’ identities are not disclosed to the authors. There will be no rebuttal period, and all submissions will be treated with strict confidentiality.

Accepted Papers

All accepted papers will be invited to participate in the poster session. Participants are required to print and bring their own posters to the event. Detailed specifications on poster format and requirements will be provided by the organizers after acceptance.

Selected papers may also be given the opportunity for an oral presentation, subject to scheduling constraints.

Please note that the workshop is non-archival and will not have official proceedings. Only the author names and titles of accepted papers will appear on the website, with no disclosure of any content.

Submission System & Process

Review Process

Accepted Papers

Selected papers may also be given the opportunity for an oral presentation, subject to scheduling constraints.

Submission System

Review Process

Accepted Papers

Selected papers may also be given the opportunity for an oral presentation, subject to scheduling constraints.

Schedule

Opening Remarks

9:00 AM - 9:10 AM

Yongjae Lee

UNIST

Keynote #1

9:10 AM - 9:50 AM

LLMs in Financial Markets: Forecasting Signals, Agent-Based Trading, and Memorization Risks

Alejandro Lopez-Lira

University of Florida

Abstract: This talk synthesizes evidence on what large language models (LLMs) can—and cannot—do for investment practice today.

(i) Forecasting from text: Building on “Can ChatGPT Forecast Stock Price Movements?”, I show that off-the-shelf LLMs extract tradable signals from news headlines, with out-of-sample return predictability that is strongest for smaller firms and after negative news. I’ll discuss data/engineering choices practitioners care about (timing alignment, transaction costs, and model governance).

(ii) LLM trading agents: From “Can Large Language Models Trade?”, I present a framework that prompts LLMs to act as coherent trading agents (e.g., value, momentum, market making) and interact in simulated markets. We examine when these agents add liquidity or amplify volatility, how strategy design affects stability, and what this implies for deployment and risk controls.

(iii) Memorization and evaluation risk: Drawing on “The Memorization Problem,” I demonstrate that LLMs can recall pre–knowledge-cutoff economic data with high fidelity, confounding historical “forecasting” tests. I’ll offer practical protocols to reduce leakage—post-cutoff evaluations, time-locked inference, masked/perturbed holdouts, and auditable data pipelines.

Keynote #2

9:50 AM - 10:30 AM

How To Choose A Threshold For An Evaluation Metric For Large Language Models

Dhagash Mehta

BlackRock

Abstract: To ensure and monitor large language models (LLMs) reliably, various evaluation metrics have been proposed in the literature. However, there is little research on prescribing a methodology to identify a robust threshold on these metrics even though there are many serious implications of an incorrect choice of the thresholds during deployment of the LLMs. Translating the traditional model risk management (MRM) guidelines within regulated industries such as the financial industry, in this talk, I will discuss a step-by-step recipe for picking a threshold for a given LLM evaluation metric. I will emphasize that such a methodology should start with identifying the risks of the LLM application under consideration and risk tolerance of the stakeholders. We then propose concrete and statistically rigorous procedures to determine a threshold for the given LLM evaluation metric using available ground-truth data.

Coffee Break

10:30 AM - 11:00 AM

Keynote #3

11:00 AM - 11:45 AM

New Frontiers of Generative AI in Finance: Structured Agentic Workflows and Foundational Time-Series Models

Hao Ni

UCL

Abstract: Building accurate and trustworthy models for financial time-series data remains a core challenge in quantitative finance. This talk presents two complementary research directions advancing the frontiers of generative AI in finance: structured agentic workflows and domain-adapted foundation models.

In the first part, I will introduce TS-Agent, a modular framework that integrates the systematic optimization of AutoML with the reasoning and adaptability of agentic AI. TS-Agent automates time-series modelling through iterative stages of model selection, code refinement, and fine-tuning, guided by curated knowledge banks. Combining the strengths of both paradigms, it enables adaptive, transparent, and auditable model development, consistently outperforming strong AutoML and agentic baselines across diverse forecasting and generative tasks.

In the second part, I will present our empirical study on time-series foundation models (TSFMs) on financial data. Using a comprehensive dataset of daily excess returns across global markets, we systematically evaluate zero-shot inference, fine-tuning, and pre-training from scratch. Pre-training on financial data and synthetic data augmentation yields substantial gains in predictive accuracy and portfolio performance, underscoring the promise of TSFM in finance.

Invited Talk #1

11:45 AM - 12:15 PM

Agents, Tools, and Truth in Finance

Nicole Cho

JP Morgan AI Research

Abstract: Large language models are rapidly shifting from single-shot predictors to agentic systems that plan, act, and reflect while invoking external tools. Yet many barriers keep LLM agents from becoming trustworthy systems in finance - including the difficulty in knowing when an agent will hallucinate, and how to orchestrate tools so the agent actually follows evidence. This 30-minute talk will focus on recent work presented at AAAI, NeuRIPs, EMNLP, COLING, ICAIF to drive closer to a unified playbook for building trustworthy LLM agents in finance. We will dive deeper into inverse reinforcement learning, hallucination prediction methods during inference, and grounding general models with inductive biases through the use of domain-specific tools.

Invited Talk #2

12:15 PM - 12:45 PM

Harnessing AI in Economic Statistics

Dr. Sojung Kim

Bank of Korea

Abstract: International organizations and central banks are increasingly exploring the use of artificial intelligence in economic statistics, leveraging high-frequency and alternative data for real-time analysis and policy decisions. This talk presents recent research by the Bank of Korea, focusing on the use of language models to analyze web-based and textual data sources.

The first part of the talk focuses on the News Sentiment Index, an experimental statistic derived from economic news. We introduce a methodology to filter economically meaningful articles and classify sentiment at both sentence and article levels, primarily through fine-tuning encoder-based models. We also discuss experiments using small LLMs with various prompting strategies for sentiment analysis.

The second part highlights an application of LLMs in official statistical research, measuring data intensity in labor markets to classify occupational text data. This study provides a systematic comparison between traditional rule-based NLP pipelines and LLM-based approaches.

Through these examples, the talk demonstrates how LLMs can complement conventional econometric methods, enabling more responsive, explainable, and scalable approaches to economic measurement while addressing remaining challenges.

Lunch Break

12:45 PM - 1:45 PM

Invited Talk #3

1:45 PM - 2:05 PM

AI Transformation in KFTC

Seonkyu Lim

KFTC

Abstract: This presentation introduces the AI-driven initiatives led by the Korea Financial Telecommunications and Clearings Institute (KFTC). Building on its nationwide payment infrastructure, KFTC is advancing graph analytics for anti-money laundering, fraudulent activity monitoring in Onnuri gift certificates, and representation learning tailored for financial fraud analysis. KFTC also contributes to the national World Best LLM Project by developing a financial domain benchmark and promoting a synthetic data ecosystem to accelerate innovation in financial AI.

Invited Talk #4

2:05 PM - 2:25 PM

Toward Reliable and Explainable Fiscal–Economic Generative AI

Yeonhee Lee

ETRI

Abstract: Recent advances in generative artificial intelligence have expanded efforts to apply such models to economic forecasting and narrative generation. While statistical generative AI shows promise across policy, finance, and industrial analysis, it still struggles to fully reflect the structural principles and real-world dynamics of economic systems. This talk introduces a new approach — Structural Generative AI — grounded in the concept of a Fiscal–Economic Digital Twin. The proposed framework embeds economic logic, policy mechanisms, and inter-sectoral interactions directly into the generative process, aiming to bridge the gap between statistical generative modeling and structurally consistent economic simulation. We explore the complementary potential of these two paradigms by comparing results from generative AI–based statistical models and digital-twin–based structural simulations under fiscal and macroeconomic scenarios. By combining structural consistency with generative flexibility, this research envisions a path toward sustainable, explainable, and trustworthy AI, ultimately evolving into an “AI Policy Assistant” that supports fiscal and economic decision-making.

Invited Talk #5

2:25 PM - 2:45 PM

Financial AI Data Services and Models at Korea Credit Information Services(KCIS)

Jongin Choi

K Credit

Abstract: As a comprehensive credit information agency in Korea, KCIS provides various financial AI services and models for financial institutions, the public sector, and academia. We provide models, datasets and services such as synthetic data generation models, persona databases and open datasets for AI learning. In this presentation, we introduce and share how to support research, innovation, and responsible AI in finance.

Invited Talk #6

2:45 PM - 3:15 PM

Foundation Model Challenges and Opportunities in Financial Services

Eric Jiawei He

RBC Borealis

Abstract: Financial services are at the core of our economy. Opportunities for machine learning abound in this space, from capital markets to insurance services to wealth management to lending to tools that assist clients in managing their money. Modern machine learning methods have transformed industries, yet particular challenges exist in realizing the full potential of machine learning in financial services. These include explainability, data imbalance, partial observations, distribution shift, and self-supervised learning in low-signal settings. I will describe the ATOM foundation model, which specializes in learning from asynchronous event sequences, to maximally utilize the richness of transactional data in financial services.

Coffee Break

3:15 PM - 4:00 PM

Accepted Papers

4:00 PM - 4:30 PM

4:00 PM -4:30 PM

Decision by Supervised Learning with Deep Ensembles: A Practical Framework for Robust Portfolio Optimization

Juhyeong Kim

Mirae Assets

Abstract: We propose Decision by Supervised Learning (DSL), a practical framework for robust portfolio optimization. DSL reframes portfolio construction as a supervised learning problem: models are trained to predict optimal portfolio weights, using cross-entropy loss and portfolios constructed by maximizing the Sharpe or Sortino ratio. To further enhance stability and reliability, DSL employs Deep Ensemble methods, substantially reducing variance in portfolio allocations. Through comprehensive backtesting across diverse market universes and neural architectures, shows superior performance compared to both traditional strategies and leading machine learning-based methods, including Prediction-Focused Learning and End-to-End Learning. We show that increasing the ensemble size leads to higher median returns and more stable risk-adjusted performance. The code is available at https://github.com/DSLwDE/DSLwDE.

NMIXX: Domain-Adapted Neural Embeddings for Cross-Lingual eXploration of Finance

Hanwool Lee

Shinhan Securities

Abstract: General-purpose sentence embedding models often struggle to capture specialized financial semantics—especially in low-resource languages like Korean—due to domain-specific jargon, temporal meaning shifts, and misaligned bilingual vocabularies. To address these gaps, we introduce NMIXX (Neural eMbeddings for Cross-lingual eXploration of Finance), a suite of cross-lingual embedding models fine-tuned with 18.8,K high-confidence triplets that pair in-domain paraphrases, hard negatives derived from a semantic-shift typology, and exact Korean-English translations. Concurrently, we release KorFinSTS, a 1,921-pair Korean financial STS benchmark spanning news, disclosures, research reports, and regulations, designed to expose nuances that general benchmarks miss. When evaluated against seven open-license baselines, NMIXX’s multilingual bge-m3 variant achieves Spearman’s rho gains of +0.10 on English FinSTS and +0.22 on KorFinSTS—outperforming its pre-adaptation checkpoint and surpassing other models by the largest margin—while revealing a modest trade-off in general STS performance. Our analysis further shows that models with richer Korean token coverage adapt more effectively, underscoring the importance of tokenizer design in low-resource, cross-lingual settings. By making both models and benchmark publicly available, we provide the community with robust tools for domain-adapted, multilingual representation learning in finance.

Invited Talk #7

4:30 PM - 5:00 PM

Engineering an AI-native Hedge Fund: From Data Infrastructure to Intelligent Agents

Joo Lee

Arrowpoint Investment Partners

Jin Kim

LinqAlpha

Invited Talk #8

5:00 PM - 5:20 PM

From Forecasting to Understanding: Explainable Financial AI in Real World Markets

Wonbin Ahn

LG AI Research

Abstract: Forecasting has always been at the heart of financial AI, but the next challenge lies in understanding why markets move. This talk introduces EXAONE Business Intelligence (EXAONE-BI), LG AI Research’s agentic framework for explainable forecasting powered by large language models. By combining time-series modeling with unstructured data such as news, filings, and analyst reports, EXAONE-BI integrates multi-modal reasoning and LLM-based explainability to generate both forecast signals and natural language interpretations that reveal the underlying drivers of market movements. We highlight its deployment in real financial ecosystems, including AI-managed ETFs (LQAI) and insight generation with the London Stock Exchange Group (LSEG), demonstrating how explainable AI can operate reliably in live market environments. The talk concludes with reflections on how large language models can connect data, reasoning, and decision-making to advance responsible and transparent financial intelligence.

Panel Discussion

5:20 PM - 6:00 PM

From Models to Markets: Deploying AI Responsibly in Financial Systems

Bhaskarjit Sarma

Domyn

Hye-young Yoon

KFTC

Eric Jiawei He

RBC Borealis

Sojung Kim

Bank of Korea

Joo Lee

Arrowpoint

Jin Kim

Moderator

Closing Remarks & Networking

6:00 PM -

6:00 PM

Yongjae Lee

UNIST

Accepted Papers

Accepted
Papers

Signed Bridge Pruning Framework : Improving Structural Reliability in Financial Knowledge Graphs

Jeongseon Kim, Yeonhee Lee, Sungsu Lim

A Double-Edged Sword: Benchmarking the Trade-off Between Bias Mitigation and Helpfulness of LLM Guardrails in Finance

Sungjun Lim, Hoyoon Byun, Jihee Kim, Kyungwoo Song

Multilingual Conversational AI for Financial Assistance: Bridging Language Barriers in Indian FinTech

Bharatdeep Hazarika, Arya Suneesh, Prasanna Devadiga, Pawan Kumar Rajpoot, Anshuman B Suresh, Ahmed Ifthaquar Hussain

LLM-Enhanced Black-Litterman Portfolio Optimization

Youngbin Lee, Yejin Kim, Juhyeong Kim, Suin Kim, Yongjae Lee

FPQ-VAE: A Dynamic Factor Model Fusing Financial Priors and Vector-Quantized Factors for Stock Rank Prediction

Namhyoung Kim, Janghyuk Youn, Jae Wook Song

FinNuE: Exposing the Risks of Using BERTScore for Numerical Semantic Evaluation in Finance

Yu-Shiang Huang, Yun-Yu Lee, Chou Tzu Hsin, Che Lin, Chuan-Ju Wang

Interpreting LLMs as Credit Risk Classifiers: Do Their Feature Explanations Align with Classical ML

Saeed Bark AlMarri, Kristof Juhasz, Mathieu Ravaut, Gautier Marti, Hamdan Ali Al Ahbabi, Ibrahim M. Elfadel

Decision by Supervised Learning with Deep Ensembles: A Practical Framework for Robust Portfolio Optimization

Juhyeong Kim, Sungyoon Choi, Youngbin Lee, Yejin Kim, Yongmin Choi, Yongjae Lee

NMIXX: Domain-Adapted Neural Embeddings for Cross-Lingual eXploration of Finance

Hanwool Lee, Yewon hwang, Jonghyun Choi, Heejae ahn, Sung Bum Jung, Youngjae Yu

Modeling Corporate Health Rating Using Online Business Reviews

Makanjuola Adekunmi Ogunleye, Nazanin Zaker, Xue Han, Jeremy Krohn

FINCH: Financial Intelligence using Natural language for Contextualized SQL Handling Avinash

Kumar Singh, Bhaskarjit Sarmah, Stefano Pasquali

Verified LRM Agents for Finance: Economics-Informed World Models and Abstract Knowledge Management for Safe and Responsible Decisioning

David Scott Lewis, Anar Batkhuu, Haley Yi, Enrique Zueco, Zijin Wu

VAP: Preventing Plausibility Traps in Algorithmic Trading with Physics-Informed Learning and Formal Verification

David Scott Lewis, Enrique Zueco

UIAGR:Up-to-date Infomation Awareness Generative Retrieval for Alipay Fund Search

Jingyuan Wen, Hong Liu, chenglei shen, Gang Yang, yedan shen, Kaixin Wu, mingjie zhong, Xu Jia, Linjian Mo

Towards Macroeconomic Policy Analysis with LLM-Agent Simulations

Taewan You

Earnings-Call Tone and Market Reactions: Evidence from Korea with a Text-Based Trading Strategy

Eunchong Kim, Jaehee Jang, NakYoung Lee

Collaborative Multi-Agent LLMs for Autonomous Portfolio Management: An Empirical Study of Distributed AI Decision-Making in Financial Markets

Quang-Vinh Dang, Ngoc-Son-An Nguyen

SLRL: Strategy-Level Reinforcement Learning for Robust Portfolio Management

Junyoung Park, SeungEun Ock, Yosep na, Young Hoon Choi, Jae Wook Song

GuruAgents: Emulating Wise Investors with Prompt-Guided LLM Agents

Yejin Kim, Youngbin Lee, Juhyeong Kim, Yongjae Lee

Knowledge Graph Construction for Stock Markets with LLM-Based Explainable Reasoning

Cheonsol Lee

Algorithmic Echo Chambers: Interpretation Cascades, Model Risk Personas, and Systemic Risk in LLM-Driven Financial Workflows

Anar Batkhuu, David Scott Lewis, Enrique Zueco, Haley Yi

FinAI Data Assistant: LLM-based Financial Database Query Processing with the OpenAI Function Calling API

Juhyeong Kim, Yejin Kim, Youngbin Lee, Hyunwoo Byun

Your AI, Not Your View: The Bias of LLMs in Investment Analysis

Hoyoung Lee, Junhyuk Seo, Suhwan Park, Junhyeong Lee, Wonbin Ahn, CHANYEOL CHOI, Alejandro Lopez-Lira, Yongjae Lee

Contact

For further details and submission guidelines, please contact

For further details and submission guidelines, please contact Prof. Yongjae Lee at yongjaeleee@gmail.com.

For further details and submission guidelines, please contact Prof. Yongjae Lee

advancesinfinancialai@gmail.com

Workshop Assistants

Hoyoung Lee

UNIST, South Korea

Katie Duke

LinqAlpha, USA

Juneha Hwang

LinqAlpha, USA

Jihoon Kwon

LinqAlpha, USA

Minjae Kim

LinqAlpha, USA

Workshop Abstract

Workshop Abstract

Call for Papers

Call for Papers

SubmissionGuidelines

Submission Guidelines

SubmissionGuidelines

Review and Acceptance Process

Review and Acceptance Process

Review and AcceptanceProcess

Schedule

Schedule

Accepted Papers

Accepted Papers

AcceptedPapers

Submission
Guidelines

Submission
Guidelines

Review and
Acceptance Process

Review and
Acceptance Process

Review and
Acceptance
Process

Accepted
Papers