Quantcast
Channel: cs.LG updates on arXiv.org
Browsing all 448 articles
Browse latest View live
↧

TurboSVM-FL: Boosting Federated Learning through SVM Aggregation for Lazy...

Federated learning is a distributed collaborative machine learning paradigm that has gained strong momentum in recent years. In federated learning, a central server periodically coordinates models with...

View Article


Noninvasive Acute Compartment Syndrome Diagnosis Using Random Forest Machine...

Acute compartment syndrome (ACS) is an orthopedic emergency, caused by elevated pressure within a muscle compartment, that leads to permanent tissue damage and eventually death. Diagnosis of ACS relies...

View Article


Preparing Lessons for Progressive Training on Language Models

The rapid progress of Transformers in artificial intelligence has come at the cost of increased resource consumption and greenhouse gas emissions due to growing model sizes. Prior work suggests using...

View Article

X Hacking: The Threat of Misguided AutoML

Explainable AI (XAI) and interpretable machine learning methods help to build trust in model predictions and derived insights, yet also present a perverse incentive for analysts to manipulate XAI...

View Article

Beyond Extraction: Contextualising Tabular Data for Efficient Summarisation...

The conventional use of the Retrieval-Augmented Generation (RAG) architecture has proven effective for retrieving information from diverse documents. However, challenges arise in handling complex table...

View Article


Kernel-U-Net: Symmetric and Hierarchical Architecture for Multivariate Time...

Time series forecasting task predicts future trends based on historical information. Transformer-based U-Net architectures, despite their success in medical image segmentation, have limitations in both...

View Article

On Learning for Ambiguous Chance Constrained Problems

We study chance constrained optimization problems $\min_x f(x)$ s.t. $P(\left\{ \theta: g(x,\theta)\le 0 \right\})\ge 1-\epsilon$ where $\epsilon\in (0,1)$ is the violation probability, when the...

View Article

In-Context Reinforcement Learning for Variable Action Spaces

Recently, it has been shown that transformers pre-trained on diverse datasets with multi-episode contexts can generalize to new reinforcement learning tasks in-context. A key limitation of previously...

View Article


Parameterized Projected Bellman Operator

Approximate value iteration (AVI) is a family of algorithms for reinforcement learning (RL) that aims to obtain an approximation of the optimal value function. Generally, AVI algorithms implement an...

View Article


FedSSA: Semantic Similarity-based Aggregation for Efficient...

Federated learning (FL) is a privacy-preserving collaboratively machine learning paradigm. Traditional FL requires all data owners (a.k.a. FL clients) to train the same local model. This design is not...

View Article

Learning the Causal Structure of Networked Dynamical Systems under Latent...

This paper considers learning the hidden causal network of a linear networked dynamical system (NDS) from the time series data at some of its nodes -- partial observability. The dynamics of the NDS are...

View Article

What Causes Polysemanticity? An Alternative Origin Story of Mixed Selectivity...

Polysemantic neurons -- neurons that activate for a set of unrelated features -- have been seen as a significant obstacle towards interpretability of task-optimized deep networks, with implications for...

View Article

Class Distribution Shifts in Zero-Shot Learning: Learning Robust Representations

Class distribution shifts are particularly challenging for zero-shot classifiers, which rely on representations learned from training classes but are deployed on new, unseen ones. Common causes for...

View Article


On robust overfitting: adversarial training induced distribution matters

Adversarial training may be regarded as standard training with a modified loss function. But its generalization error appears much larger than standard training under standard loss. This phenomenon,...

View Article

Deciphering and integrating invariants for neural operator learning with...

Neural operators have been explored as surrogate models for simulating physical systems to overcome the limitations of traditional partial differential equation (PDE) solvers. However, most existing...

View Article


Linear Log-Normal Attention with Unbiased Concentration

Transformer models have achieved remarkable results in a wide range of applications. However, their scalability is hampered by the quadratic time and memory complexity of the self-attention mechanism...

View Article

Efficient Reinforcement Learning from Partial Observability

In most real-world reinforcement learning applications, state information is only partially observable, which breaks the Markov decision process assumption and leads to inferior performance for...

View Article


DeliverAI: Reinforcement Learning Based Distributed Path-Sharing Network for...

Delivery of items from the producer to the consumer has experienced significant growth over the past decade and has been greatly fueled by the recent pandemic. Amazon Fresh, Shopify, UberEats,...

View Article

Improving Robustness via Tilted Exponential Layer: A Communication-Theoretic...

State-of-the-art techniques for enhancing robustness of deep networks mostly rely on empirical risk minimization with suitable data augmentation. In this paper, we propose a complementary approach...

View Article

COSTAR: Improved Temporal Counterfactual Estimation with Self-Supervised...

Estimation of temporal counterfactual outcomes from observed history is crucial for decision-making in many domains such as healthcare and e-commerce, particularly when randomized controlled trials...

View Article
Browsing all 448 articles
Browse latest View live