Entropy-Regularized Token-Level Policy Optimization for Large Language Models
Large Language Models (LLMs) have shown promise as intelligent agents in interactive decision-making tasks. Traditional approaches often depend on meticulously designed prompts, high-quality examples,...
View ArticleFeed-Forward Neural Networks as a Mixed-Integer Program
Deep neural networks (DNNs) are widely studied in various applications. A DNN consists of layers of neurons that compute affine combinations, apply nonlinear operations, and produce corresponding...
View ArticleFL-NAS: Towards Fairness of NAS for Resource Constrained Devices via Large...
Neural Architecture Search (NAS) has become the de fecto tools in the industry in automating the design of deep neural networks for various applications, especially those driven by mobile and edge...
View ArticleScaling Intelligent Agents in Combat Simulations for Wargaming
Remaining competitive in future conflicts with technologically-advanced competitors requires us to accelerate our research and development in artificial intelligence (AI) for wargaming. More...
View ArticleComparison of machine learning and statistical approaches for digital...
Several methods have been proposed for correcting the elevation bias in digital elevation models (DEMs) for example, linear regression. Nowadays, supervised machine learning enables the modelling of...
View ArticleA Masked language model for multi-source EHR trajectories contextual...
Using electronic health records data and machine learning to guide future decisions needs to address challenges, including 1) long/short-term dependencies and 2) interactions between diseases and...
View ArticleSign Rank Limitations for Attention-Based Graph Decoders
Inner product-based decoders are among the most influential frameworks used to extract meaningful data from latent embeddings. However, such decoders have shown limitations in representation capacity...
View ArticleUsing remotely sensed data for air pollution assessment
Air pollution constitutes a global problem of paramount importance that affects not only human health, but also the environment. The existence of spatial and temporal data regarding the concentrations...
View Article