cs.LG updates on arXiv.org

↧

Measurement Scheduling for ICU Patients with Offline Reinforcement Learning

February 12, 2024, 9:00 pm

Scheduling laboratory tests for ICU patients presents a significant challenge. Studies show that 20-40% of lab tests ordered in the ICU are redundant and could be eliminated without compromising...

View Article

Random Geometric Graph Alignment with Graph Neural Networks

February 12, 2024, 9:00 pm

We characterize the performance of graph neural networks for graph alignment problems in the presence of vertex feature information. More specifically, given two graphs that are independent...

View Article

Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs

February 12, 2024, 9:00 pm

How do transformer-based large language models (LLMs) store and retrieve knowledge? We focus on the most basic form of this task -- factual recall, where the model is tasked with explicitly surfacing...

View Article

ODIN: Disentangled Reward Mitigates Hacking in RLHF

February 12, 2024, 9:00 pm

In this work, we study the issue of reward hacking on the response length, a challenge emerging in Reinforcement Learning from Human Feedback (RLHF) on LLMs. A well-formatted, verbose but less helpful...

View Article

A Theoretical Analysis of Nash Learning from Human Feedback under General...

February 12, 2024, 9:00 pm

Reinforcement Learning from Human Feedback (RLHF) learns from the preference signal provided by a probabilistic preference model, which takes a prompt and two responses as input, and produces a score...

View Article

HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node...

February 12, 2024, 9:00 pm

Hypergraphs are marked by complex topology, expressing higher-order interactions among multiple entities with hyperedges. Lately, hypergraph-based deep learning methods to learn informative data...

View Article

Training Heterogeneous Client Models using Knowledge Distillation in...

February 12, 2024, 9:00 pm

Federated Learning (FL) is an emerging machine learning paradigm that enables the collaborative training of a shared global model across distributed clients while keeping the data decentralized. Recent...

View Article

Power Transformer Fault Prediction Based on Knowledge Graphs

February 12, 2024, 9:00 pm

In this paper, we address the challenge of learning with limited fault data for power transformers. Traditional operation and maintenance tools lack effective predictive capabilities for potential...

View Article

Can Tree Based Approaches Surpass Deep Learning in Anomaly Detection? A...

February 12, 2024, 9:00 pm

Detection of anomalous situations for complex mission-critical systems holds paramount importance when their service continuity needs to be ensured. A major challenge in detecting anomalies from the...

View Article

Physics-Informed Neural Networks with Hard Linear Equality Constraints

February 12, 2024, 9:00 pm

Surrogate modeling is used to replace computationally expensive simulations. Neural networks have been widely applied as surrogate models that enable efficient evaluations over complex physical...

View Article

DIMON: Learning Solution Operators of Partial Differential Equations on a...

February 12, 2024, 9:00 pm

The solution of a PDE over varying initial/boundary conditions on multiple domains is needed in a wide variety of applications, but it is computationally expensive if the solution is computed de novo...

View Article

The Impact of Domain Knowledge and Multi-Modality on Intelligent Molecular...

February 12, 2024, 9:00 pm

The precise prediction of molecular properties is essential for advancements in drug development, particularly in virtual screening and compound optimization. The recent introduction of numerous deep...

View Article

Depth Separations in Neural Networks: Separating the Dimension from the Accuracy

February 12, 2024, 9:00 pm

We prove an exponential separation between depth 2 and depth 3 neural networks, when approximating an $\mathcal{O}(1)$-Lipschitz target function to constant accuracy, with respect to a distribution...

View Article

Towards Generalized Inverse Reinforcement Learning

February 12, 2024, 9:00 pm

This paper studies generalized inverse reinforcement learning (GIRL) in Markov decision processes (MDPs), that is, the problem of learning the basic components of an MDP given observed behavior...

View Article

GenSTL: General Sparse Trajectory Learning via Auto-regressive Generation of...

February 12, 2024, 9:00 pm

Trajectories are sequences of timestamped location samples. In sparse trajectories, the locations are sampled infrequently; and while such trajectories are prevalent in real-world settings, they are...

View Article

Rethinking Graph Masked Autoencoders through Alignment and Uniformity

February 12, 2024, 9:00 pm

Self-supervised learning on graphs can be bifurcated into contrastive and generative methods. Contrastive methods, also known as graph contrastive learning (GCL), have dominated graph self-supervised...

View Article

Towards Fast Stochastic Sampling in Diffusion Generative Models

February 12, 2024, 9:00 pm

Diffusion models suffer from slow sample generation at inference time. Despite recent efforts, improving the sampling efficiency of stochastic samplers for diffusion models remains a promising...

View Article

More Benefits of Being Distributional: Second-Order Bounds for Reinforcement...

February 12, 2024, 9:00 pm

In this paper, we prove that Distributional Reinforcement Learning (DistRL), which learns the return distribution, can obtain second-order bounds in both online and offline RL in general settings with...

View Article

The Implicit Bias of Gradient Noise: A Symmetry Perspective

February 12, 2024, 9:00 pm

We characterize the learning dynamics of stochastic gradient descent (SGD) when continuous symmetry exists in the loss function, where the divergence between SGD and gradient descent is dramatic. We...

View Article

GSINA: Improving Subgraph Extraction for Graph Invariant Learning via Graph...

February 12, 2024, 9:00 pm

Graph invariant learning (GIL) has been an effective approach to discovering the invariant relationships between graph data and its labels for different graph learning tasks under various distribution...

View Article