cs.LG updates on arXiv.org

↧

An Inexact Halpern Iteration with Application to Distributionally Robust...

February 12, 2024, 9:00 pm

The Halpern iteration for solving monotone inclusion problems has gained increasing interests in recent years due to its simple form and appealing convergence properties. In this paper, we investigate...

View Article

EmojiCrypt: Prompt Encryption for Secure Communication with Large Language...

February 12, 2024, 9:00 pm

Cloud-based large language models (LLMs) such as ChatGPT have increasingly become integral to daily operations, serving as vital tools across various applications. While these models offer substantial...

View Article

Using YOLO v7 to Detect Kidney in Magnetic Resonance Imaging

February 12, 2024, 9:00 pm

Introduction This study explores the use of the latest You Only Look Once (YOLO V7) object detection method to enhance kidney detection in medical imaging by training and testing a modified YOLO V7 on...

View Article

BIKED++: A Multimodal Dataset of 1.4 Million Bicycle Image and Parametric CAD...

February 12, 2024, 9:00 pm

This paper introduces a public dataset of 1.4 million procedurally-generated bicycle designs represented parametrically, as JSON files, and as rasterized images. The dataset is created through the use...

View Article

Anatomically-Controllable Medical Image Generation with Segmentation-Guided...

February 12, 2024, 9:00 pm

Diffusion models have enabled remarkably high-quality medical image generation, which can help mitigate the expenses of acquiring and annotating new images by supplementing small or imbalanced...

View Article

RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback

February 12, 2024, 9:00 pm

Reward engineering has long been a challenge in Reinforcement Learning (RL) research, as it often requires extensive human effort and iterative processes of trial-and-error to design effective reward...

View Article

Test-Time Adaptation for Depth Completion

February 12, 2024, 9:00 pm

It is common to observe performance degradation when transferring models trained on some (source) datasets to target testing data due to a domain gap between them. Existing methods for bridging this...

View Article

BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text...

February 12, 2024, 9:00 pm

In this paper, we present a new embedding model, called M3-Embedding, which is distinguished for its versatility in Multi-Linguality, Multi-Functionality, and Multi-Granularity. It can support more...

View Article

Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game...

February 12, 2024, 9:00 pm

A recent theory shows that a multi-player decentralized partially observable Markov decision process can be transformed into an equivalent single-player game, enabling the application of...

View Article

On the Exploitation of DCT-Traces in the Generative-AI Domain

February 12, 2024, 9:00 pm

Deepfakes represent one of the toughest challenges in the world of Cybersecurity and Digital Forensics, especially considering the high-quality results obtained with recent generative AI-based...

View Article

Quality and Trust in LLM-generated Code

February 12, 2024, 9:00 pm

Machine learning models are widely used but can also often be wrong. Users would benefit from a reliable indication of whether a given output from a given model should be trusted, so a rational...

View Article

Prompting Large Language Models for Zero-Shot Clinical Prediction with...

February 12, 2024, 9:00 pm

The inherent complexity of structured longitudinal Electronic Health Records (EHR) data poses a significant challenge when integrated with Large Language Models (LLMs), which are traditionally tailored...

View Article

Skip \n: A Simple Method to Reduce Hallucination in Large Vision-Language Models

February 12, 2024, 9:00 pm

Recent advancements in large vision-language models (LVLMs) have demonstrated impressive capability in visual information understanding with human language. Despite these advances, LVLMs still face...

View Article

Reproducibility, energy efficiency and performance of pseudorandom number...

February 12, 2024, 9:00 pm

Pseudo-Random Number Generators (PRNGs) have become ubiquitous in machine learning technologies because they are interesting for numerous methods. The field of machine learning holds the potential for...

View Article

On the Semantics of LM Latent Space: A Vocabulary-defined Approach

February 12, 2024, 9:00 pm

Understanding the latent space of language models (LM) is crucial to refining their performance and interpretability. Existing analyses often fall short in providing disentangled (model-centric)...

View Article

Inverse analysis of granular flows using differentiable graph neural network...

February 12, 2024, 9:00 pm

Inverse problems in granular flows, such as landslides and debris flows, involve estimating material parameters or boundary conditions based on target runout profile. Traditional high-fidelity...

View Article

Vision Mamba: Efficient Visual Representation Learning with Bidirectional...

February 12, 2024, 9:00 pm

Recently the state space models (SSMs) with efficient hardware-aware designs, i.e., the Mamba deep learning model, have shown great potential for long sequence modeling. Meanwhile building efficient...

View Article

Augmenting Math Word Problems via Iterative Question Composing

February 12, 2024, 9:00 pm

Despite the advancements in large language models (LLMs) for mathematical reasoning, solving competition-level math problems remains a significant challenge, especially for open-source LLMs without...

View Article

Keep or toss? A nonparametric score to evaluate solutions for noisy ICA

February 12, 2024, 9:00 pm

Independent Component Analysis (ICA) was introduced in the 1980's as a model for Blind Source Separation (BSS), which refers to the process of recovering the sources underlying a mixture of signals,...

View Article

Scalable network reconstruction in subquadratic time

February 12, 2024, 9:00 pm

Network reconstruction consists in determining the unobserved pairwise couplings between $N$ nodes given only observational data on the resulting behavior that is conditioned on those couplings --...

View Article