cs.LG updates on arXiv.org

↧

Diffusion Model-based Probabilistic Downscaling for 180-year East Asian...

February 12, 2024, 9:00 pm

As our planet is entering into the "global boiling" era, understanding regional climate change becomes imperative. Effective downscaling methods that provide localized insights are crucial for this...

View Article

From GARCH to Neural Network for Volatility Forecast

February 12, 2024, 9:00 pm

Volatility, as a measure of uncertainty, plays a crucial role in numerous financial activities such as risk management. The Econometrics and Machine Learning communities have developed two distinct...

View Article

Transformers with Attentive Federated Aggregation for Time Series Stock...

February 12, 2024, 9:00 pm

Recent innovations in transformers have shown their superior performance in natural language processing (NLP) and computer vision (CV). The ability to capture long-range dependencies and interactions...

View Article

Large (and Deep) Factor Models

February 12, 2024, 9:00 pm

We open up the black box behind Deep Learning for portfolio optimization and prove that a sufficiently wide and arbitrarily deep neural network (DNN) trained to maximize the Sharpe ratio of the...

View Article

SocraSynth: Multi-LLM Reasoning with Conditional Statistics

February 12, 2024, 9:00 pm

Large language models (LLMs), while promising, face criticisms for biases, hallucinations, and a lack of reasoning capability. This paper introduces SocraSynth, a multi-LLM agent reasoning platform...

View Article

MDGNN: Multi-Relational Dynamic Graph Neural Network for Comprehensive and...

February 12, 2024, 9:00 pm

The stock market is a crucial component of the financial system, but predicting the movement of stock prices is challenging due to the dynamic and intricate relations arising from various aspects such...

View Article

FAST: Factorizable Attention for Speeding up Transformers

February 12, 2024, 9:00 pm

Motivated by the factorization inherent in the original fast multipole method and the improved fast Gauss transform we introduce a factorable form of attention that operates efficiently in high...

View Article

Policy Improvement using Language Feedback Models

February 12, 2024, 9:00 pm

We introduce Language Feedback Models (LFMs) that identify desirable behaviour - actions that help achieve tasks specified in the instruction - for imitation learning in instruction following. To train...

View Article

Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation...

February 12, 2024, 9:00 pm

In modern machine learning, models can often fit training data in numerous ways, some of which perform well on unseen (test) data, while others do not. Remarkably, in such cases gradient descent...

View Article

Scaling Laws for Fine-Grained Mixture of Experts

February 12, 2024, 9:00 pm

Mixture of Experts (MoE) models have emerged as a primary solution for reducing the computational cost of Large Language Models. In this work, we analyze their scaling properties, incorporating an...

View Article

Nesting Particle Filters for Experimental Design in Dynamical Systems

February 12, 2024, 9:00 pm

In this paper, we propose a novel approach to Bayesian Experimental Design (BED) for non-exchangeable data that formulates it as risk-sensitive policy optimization. We develop the Inside-Out SMC^2...

View Article

Multiscale Neuroimaging Features for the Identification of Medication Class...

February 12, 2024, 9:00 pm

In the clinical treatment of mood disorders, the complex behavioral symptoms presented by patients and variability of patient response to particular medication classes can create difficulties in...

View Article

Comparing skill of historical rainfall data based monsoon rainfall prediction...

February 12, 2024, 9:00 pm

In this draft we consider the problem of forecasting rainfall across India during the four monsoon months, one day as well as three days in advance. We train neural networks using historical daily...

View Article

Generative Modeling of Discrete Joint Distributions by E-Geodesic Flow...

February 12, 2024, 9:00 pm

This paper introduces a novel generative model for discrete distributions based on continuous normalizing flows on the submanifold of factorizing discrete measures. Integration of the flow gradually...

View Article

An Investigation into Using Unsupervised Metrics to Optimise GNNs for Node...

February 12, 2024, 9:00 pm

Graph Neural Networks (GNNs) can be trained to detect communities within a graph by learning from the duality of feature and connectivity information. Currently, the common approach for optimisation of...

View Article

Generalizing across Temporal Domains with Koopman Operators

February 12, 2024, 9:00 pm

In the field of domain generalization, the task of constructing a predictive model capable of generalizing to a target domain without access to target data remains challenging. This problem becomes...

View Article

On Computationally Efficient Multi-Class Calibration

February 12, 2024, 9:00 pm

Consider a multi-class labelling problem, where the labels can take values in $[k]$, and a predictor predicts a distribution over the labels. In this work, we study the following foundational question:...

View Article

Differentially Private Zeroth-Order Methods for Scalable Large Language Model...

February 12, 2024, 9:00 pm

Finetuning on task-specific datasets is a widely-embraced paradigm of harnessing the powerful capability of pretrained LLMs for various downstream tasks. Due to the popularity of LLMs finetuning and...

View Article

Sourcerer: Sample-based Maximum Entropy Source Distribution Estimation

February 12, 2024, 9:00 pm

Scientific modeling applications often require estimating a distribution of parameters consistent with a dataset of observations - an inference task also known as source distribution estimation. This...

View Article

Empowering Federated Learning for Massive Models with NVIDIA FLARE

February 12, 2024, 9:00 pm

In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge. Most state-of-the-art machine...

View Article