cs.LG updates on arXiv.org

↧

Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models

February 12, 2024, 9:00 pm

Large Language Models (LLMs) based on Mixture-of-Experts (MoE) architecture are showing promising performance on various tasks. However, running them on resource-constrained settings, where GPU memory...

View Article

Informativeness of Reward Functions in Reinforcement Learning

February 12, 2024, 9:00 pm

Reward functions are central in specifying the task we want a reinforcement learning agent to perform. Given a task and desired optimal behavior, we study the problem of designing informative reward...

View Article

FedImpro: Measuring and Improving Client Update in Federated Learning

February 12, 2024, 9:00 pm

Federated Learning (FL) models often experience client drift caused by heterogeneous data, where the distribution of data differs across clients. To address this issue, advanced research primarily...

View Article

Clients Collaborate: Flexible Differentially Private Federated Learning with...

February 12, 2024, 9:00 pm

To defend against privacy leakage of user data, differential privacy is widely used in federated learning, but it is not free. The addition of noise randomly disrupts the semantic integrity of the...

View Article

Guided Sketch-Based Program Induction by Search Gradients

February 12, 2024, 9:00 pm

Many tasks can be easily solved using machine learning techniques. However, some tasks cannot readily be solved using statistical models, requiring a symbolic approach instead. Program induction is one...

View Article

Non-linear Fusion in Federated Learning: A Hypernetwork Approach to Federated...

February 12, 2024, 9:00 pm

Federated Learning (FL) has emerged as a promising paradigm in which multiple clients collaboratively train a shared global model while preserving data privacy. To create a robust and practicable FL...

View Article

In-Context Data Distillation with TabPFN

February 12, 2024, 9:00 pm

Foundation models have revolutionized tasks in computer vision and natural language processing. However, in the realm of tabular data, tree-based models like XGBoost continue to dominate. TabPFN, a...

View Article

Contextual Stochastic Vehicle Routing with Time Windows

February 12, 2024, 9:00 pm

We study the vehicle routing problem with time windows (VRPTW) and stochastic travel times, in which the decision-maker observes related contextual information, represented as feature variables, before...

View Article

DeepCover: Advancing RNN Test Coverage and Online Error Prediction using...

February 12, 2024, 9:00 pm

Recurrent neural networks (RNNs) have emerged as powerful tools for processing sequential data in various fields, including natural language processing and speech recognition. However, the lack of...

View Article

Tree Ensembles for Contextual Bandits

February 12, 2024, 9:00 pm

We propose a novel framework for contextual multi-armed bandits based on tree ensembles. Our framework integrates two widely used bandit methods, Upper Confidence Bound and Thompson Sampling, for both...

View Article

Training dynamics in Physics-Informed Neural Networks with feature mapping

February 12, 2024, 9:00 pm

Physics-Informed Neural Networks (PINNs) have emerged as an iconic machine learning approach for solving Partial Differential Equations (PDEs). Although its variants have achieved significant progress,...

View Article

OpenFedLLM: Training Large Language Models on Decentralized Private Data via...

February 12, 2024, 9:00 pm

Trained on massive publicly available data, large language models (LLMs) have demonstrated tremendous success across various fields. While more data contributes to better performance, a disconcerting...

View Article

Assessing Uncertainty Estimation Methods for 3D Image Segmentation under...

February 12, 2024, 9:00 pm

In recent years, machine learning has witnessed extensive adoption across various sectors, yet its application in medical image-based disease detection and diagnosis remains challenging due to...

View Article

Learning Attributed Graphlets: Predictive Graph Mining by Graphlets with...

February 12, 2024, 9:00 pm

The graph classification problem has been widely studied; however, achieving an interpretable model with high predictive performance remains a challenging issue. This paper proposes an interpretable...

View Article

Clustering Techniques Selection for a Hybrid Regression Model: A Case Study...

February 12, 2024, 9:00 pm

This work addresses the performance comparison between four clustering techniques with the objective of achieving strong hybrid models in supervised learning tasks. A real dataset from a bio-climatic...

View Article

Generating Chain-of-Thoughts with a Direct Pairwise-Comparison Approach to...

February 12, 2024, 9:00 pm

To improve the ability of the large language model (LLMs) to handle complex reasoning problems, chain-of-thoughts (CoT) methods were proposed to guide LLMs to reason step-by-step, facilitating problem...

View Article

Solving Deep Reinforcement Learning Benchmarks with Linear Policy Networks

February 12, 2024, 9:00 pm

Although Deep Reinforcement Learning (DRL) methods can learn effective policies for challenging problems such as Atari games and robotics tasks, algorithms are complex and training times are often...

View Article

Topological Neural Networks: Mitigating the Bottlenecks of Graph Neural...

February 12, 2024, 9:00 pm

The irreducible complexity of natural phenomena has led Graph Neural Networks to be employed as a standard model to perform representation learning tasks on graph-structured data. While their capacity...

View Article

Understanding Test-Time Augmentation

February 12, 2024, 9:00 pm

Test-Time Augmentation (TTA) is a very powerful heuristic that takes advantage of data augmentation during testing to produce averaged output. Despite the experimental effectiveness of TTA, there is...

View Article

Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF

February 12, 2024, 9:00 pm

Bilevel optimization has been recently applied to many machine learning tasks. However, their applications have been restricted to the supervised learning setting, where static objective functions with...

View Article