Encoder/Decoder Transformer Model

Google Gemma 4 12B Brings Multimodal AI to 16GB Laptops, Free Under Apache 2.0

Google Gemma 4 12B, released June 3, is an open-weight multimodal model that processes text, images, audio, and video in a ...

IEEE

Seismic Facies Segmentation via a Segformer-Based Specific Encoder–Decoder–Hypercolumns Scheme

Abstract: Seismic facies classification plays an important role in oil and gas reservoir interpretation. In the past few years, convolution neural network (CNN)-based models have been widely used in ...

EDN

MLPerf and the rise of latency-aware LLM benchmarking

Here is a sneak peek at the evolution of the MLPerf benchmark and how generative AI forced a radical shift in AI hardware ...

IEEE

Remote-Sensing Image Captioning Based on Multilayer Aggregated Transformer

Abstract: Remote-sensing image (RSI) captioning aims to automatically generate sentences describing the content of RSIs. The multiscale information of RSIs contains attributes and complex ...

Memeburn

Google's Gemma 4 12B Runs AI Natively on Your Laptop — No Cloud Needed

Google's Gemma 4 12B brings multimodal AI — audio, video, and text — to a standard 16GB laptop in 2026. No cloud required. Here's what it does and why it matters.

GitHub

MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers

This repository contains the implementation for the paper: MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers by Yawar Siddiqui, Antonio Alliegro, Alexey Artemov, Tatiana Tommasi, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results