cs.LG updates on arXiv.org

↧

Reinforcement Unlearning

February 12, 2024, 9:00 pm

Machine unlearning refers to the process of mitigating the influence of specific training data on machine learning models based on removal requests from data owners. However, one important area that...

View Article

NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models...

February 12, 2024, 9:00 pm

Complex reasoning ability is one of the most important features of current LLMs, which has also been leveraged to play an integral role in complex decision-making tasks. Therefore, the investigation...

View Article

Beyond mirkwood: Enhancing SED Modeling with Conformal Predictions

February 12, 2024, 9:00 pm

Traditional spectral energy distribution (SED) fitting techniques face uncertainties due to assumptions in star formation histories and dust attenuation curves. We propose an advanced machine...

View Article

Fast, Scalable, Warm-Start Semidefinite Programming with Spectral Bundling...

February 12, 2024, 9:00 pm

While semidefinite programming (SDP) has traditionally been limited to moderate-sized problems, recent algorithms augmented with matrix sketching techniques have enabled solving larger SDPs. However,...

View Article

Fast sampling from constrained spaces using the Metropolis-adjusted Mirror...

February 12, 2024, 9:00 pm

We propose a new method called the Metropolis-adjusted Mirror Langevin algorithm for approximate sampling from distributions whose support is a compact and convex set. This algorithm adds an...

View Article

Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large...

February 12, 2024, 9:00 pm

In this paper, we demonstrate that an inherent waveform pattern in the attention allocation of large language models (LLMs) significantly affects their performance in tasks demanding a high degree of...

View Article

MultiGPrompt for Multi-Task Pre-Training and Prompting on Graphs

February 12, 2024, 9:00 pm

Graphs can inherently model interconnected objects on the Web, thereby facilitating a series of Web applications, such as web analyzing and content recommendation. Recently, Graph Neural Networks...

View Article

Is Inverse Reinforcement Learning Harder than Standard Reinforcement...

February 12, 2024, 9:00 pm

Inverse Reinforcement Learning (IRL) -- the problem of learning reward functions from demonstrations of an \emph{expert policy} -- plays a critical role in developing intelligent systems. While widely...

View Article

Detection of developmental language disorder in Cypriot Greek children using...

February 12, 2024, 9:00 pm

Children with developmental language disorder (DLD) encounter difficulties in acquiring various language structures. Early identification and intervention are crucial to prevent negative long-term...

View Article

Universal Jailbreak Backdoors from Poisoned Human Feedback

February 12, 2024, 9:00 pm

Reinforcement Learning from Human Feedback (RLHF) is used to align large language models to produce helpful and harmless responses. Yet, prior work showed these models can be jailbroken by finding...

View Article

Exploring the impact of social stress on the adaptive dynamics of COVID-19:...

February 12, 2024, 9:00 pm

In the context of natural disasters, human responses inevitably intertwine with natural factors. The COVID-19 pandemic, as a significant stress factor, has brought to light profound variations among...

View Article

Discovering Effective Policies for Land-Use Planning

February 12, 2024, 9:00 pm

How areas of land are allocated for different uses, such as forests, urban areas, and agriculture, has a large effect on the terrestrial carbon balance, and therefore climate change. Based on available...

View Article

On Measuring Faithfulness or Self-consistency of Natural Language Explanations

February 12, 2024, 9:00 pm

Large language models (LLMs) can explain their predictions through post-hoc or Chain-of-Thought (CoT) explanations. But an LLM could make up reasonably sounding explanations that are unfaithful to its...

View Article

Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large...

February 12, 2024, 9:00 pm

We propose the Data Contamination Quiz (DCQ), a simple and effective approach to detect data contamination in large language models (LLMs) and estimate the amount of it. Specifically, we frame data...

View Article

Kernel-, mean- and noise-marginalised Gaussian processes for exoplanet...

February 12, 2024, 9:00 pm

Using a fully Bayesian approach, Gaussian Process regression is extended to include marginalisation over the kernel choice and kernel hyperparameters. In addition, Bayesian model comparison via the...

View Article

Graph Neural Networks for Road Safety Modeling: Datasets and Evaluations for...

February 12, 2024, 9:00 pm

We consider the problem of traffic accident analysis on a road network based on road network connections and traffic volume. Previous works have designed various deep-learning methods using historical...

View Article

SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL...

February 12, 2024, 9:00 pm

In recent years, there has been growing interest in text-to-SQL translation, which is the task of converting natural language questions into executable SQL queries. This technology is important for its...

View Article

Denoising Heat-inspired Diffusion with Insulators for Collision Free Motion...

February 12, 2024, 9:00 pm

Diffusion models have risen as a powerful tool in robotics due to their flexibility and multi-modality. While some of these methods effectively address complex problems, they often depend heavily on...

View Article

Learning optimal integration of spatial and temporal information in noisy...

February 12, 2024, 9:00 pm

We investigate the boundary between chemotaxis driven by spatial estimation of gradients and chemotaxis driven by temporal estimation. While it is well known that spatial chemotaxis becomes...

View Article

ByteStack-ID: Integrated Stacked Model Leveraging Payload Byte Frequency for...

February 12, 2024, 9:00 pm

In the ever-evolving realm of network security, the swift and accurate identification of diverse attack classes within network traffic is of paramount importance. This paper introduces "ByteStack-ID,"...

View Article