Posts

Showing posts from May, 2023

Japan Goes All In: Copyright Doesn’t Apply To AI Training #1148

Image
Japan Goes All In: Copyright Doesn’t Apply To AI Training #1148 Source  

Large sequence models for software development activities #1147

Image
Large sequence models for software development activities #1147 Source

UAE’s Falcon 40B, World’s Top-Ranked AI Model from Technology Innovation Institute is Now Royalty-free #1146

Image
UAE’s Falcon 40B, World’s Top-Ranked AI Model from Technology Innovation Institute is Now Royalty-free #1146 Source

OpenAI: We trained an AI using process supervision — rewarding the thought process rather than the outcome — to achieve new state-of-art in mathematical reasoning #1145

Image
OpenAI: We trained an AI using process supervision — rewarding the thought process rather than the outcome — to achieve new state-of-art in mathematical reasoning #1145 Source Tweet

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation #1144

Image
StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation #1144 Source

AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation #1143

Image
Source

A cool AR/VR concept for the future #1142

Image
A cool AR/VR concept for the future #1142 Source Source

Many AI researchers and other figures sign AI statement risk #1141

Image
Many AI researchers and other figures sign AI statement risk #1141 Source  

Nvidia showcases real-time AI conversation in a game using voice #1140

Image
Nvidia showcases real-time AI conversation in a game using voice #1140 Source  

Generating Images with Multimodal Language Models #1139

Image
Generating Images with Multimodal Language Models #1139 Source  

Photoswap: Personalized Subject Swapping in Images #1138

Image
Photoswap: Personalized Subject Swapping in Images #1138 Source

Break-A-Scene: Extracting Multiple Concepts from a Single Image #1137

Image
Break-A-Scene: Extracting Multiple Concepts from a Single Image #1137 Source

PandaGPT: One Model To Instruction-Follow Them All #1136

Image
PandaGPT: One Model To Instruction-Follow Them All #1136 Source

Think Before You Act: Decision Transformers with Internal Working Memory #1135

Image
Think Before You Act: Decision Transformers with Internal Working Memory #1135 Source

ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing #1134

Image
ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing #1134 Source

A Neural Space-Time Representation for Text-to-Image Personalization (similar to Dreambooth) #1133

Image
A Neural Space-Time Representation for Text-to-Image Personalization (similar to Dreambooth) #1133 Source

state-of-the-art fMRI-to-image approach that retrieves and reconstructs images from brain activity #1132

Image
state-of-the-art fMRI-to-image approach that retrieves and reconstructs images from brain activity #1132 Source

Voyager: An Open-Ended Embodied Agent with Large Language Models (LLM playing Minecraft) #1131

Image
Voyager: An Open-Ended Embodied Agent with Large Language Models (LLM playing Minecraft) #1131 Source

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation #1130

Image
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation #1130 Source

New open-source LLMs called Falcon, which comes into size 7B trained on 1.5T tokens and 40B trained on 1T Tokens #1129

Image
New open-source LLMs called Falcon, which comes into size 7B trained on 1.5T tokens and 40B trained on 1T Tokens #1129 Source

Neuralink recieved its first Human clinical trial approval from FDA #1128

Image
Neuralink recieved its first Human clinical trial approval from FDA #1128 Source  

Alexandria, an open-source initiative to embed the internet, starting with Arxiv #1127

Image
Alexandria, an open-source initiative to embed the internet, starting with Arxiv #1127 Source

Stable Diffusion “Reimagine XL” model #1126

Image
Stable Diffusion “Reimagine XL” model #1126 Source  

Bard will include images in responses for relevant prompts, including when you specifically ask for images #1125

Image
Bard will include images in responses for relevant prompts, including when you specifically ask for images #1125 Source

Microsoft just announced Power Virtual Agent, a new generative actions engine in its chatbot builder #1124

Image
Microsoft just announced Power Virtual Agent, a new generative actions engine in its chatbot builder #1124 Source

SiamMAE: Siamese Masked Autoencoders for self-supervised representation learning from videos #1123

Image
SiamMAE: Siamese Masked Autoencoders for self-supervised representation learning from videos #1123 Source

Introducing #ClinicalCamel 🐪: an innovative open-source conversational #LLM engineered specifically for healthcare #1122

Image
Introducing #ClinicalCamel 🐪: an innovative open-source conversational #LLM engineered specifically for healthcare #1122 Source  

Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks #1121

Image
Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks #1121 Source  

QLoRA: Efficient Finetuning of Quantized LLMs #1120

Image
QLoRA: Efficient Finetuning of Quantized LLMs #1120 Source

Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach #1119

Image
Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach #1119 Source  

Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models #1118

Image
Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models #1118 Source  

Introducing Microsoft Fabric: Data analytics for the era of AI #1117

Image
Introducing Microsoft Fabric: Data analytics for the era of AI #1117 Source

Intel Announces Aurora genAI, Generative AI Model With 1 Trillion Parameters #1116

Image
Intel Announces Aurora genAI, Generative AI Model With 1 Trillion Parameters #1116 Source  

Adobe just added their first Generative AI tool to Photoshop #1115

Image
Adobe just added their first Generative AI tool to Photoshop #1115 Source

Google introduces Product Studio, a tool that lets merchants create product imagery using generative AI #1114

Image
Google introduces Product Studio, a tool that lets merchants create product imagery using generative AI #1114 Source  

Announcing, Windows Copilot. Integrating the power of Bing Chat across all of Windows and all your apps #1113

Image
Announcing, Windows Copilot. Integrating the power of Bing Chat across all of Windows and all your apps #1113 Source  

Meta's MMS: Massively Multilingual Speech - Can do speech2text and text speech in 1100 languages #1112

Image
Meta's MMS: Massively Multilingual Speech - Can do speech2text and text speech in 1100 languages #1112 Source  

OpenAl: Al systems will exceed expert skill level in most domains within the next 10 years! #1119

Image
OpenAl: Al systems will exceed expert skill level in most domains within the next 10 years! #1119 Source  

RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture #1118

Image
RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture #1118 Source  

Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields #1117

Image
Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields #1117 Source  

Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity #1116

Image
Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity #1116 Source  

Any-to-Any Generation via Composable Diffusion - capable of generating any combination of output modalities, such as language, image, video, or audio, from any combination of input modalities #1115

Image
Any-to-Any Generation via Composable Diffusion - capable of generating any combination of output modalities, such as language, image, video, or audio, from any combination of input modalities #1115 Source  

LIMA: Less Is More for Alignment #1114

Image
LIMA: Less Is More for Alignment #1114 Source  

BlockadeLabs announced a Sketch mode for its Skybox AI image generator, which creates environments based on the lines you draw and your text prompt #1113

Image
BlockadeLabs announced a Sketch mode for its Skybox AI image generator, which creates environments based on the lines you draw and your text prompt #1113 Source  

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold #1112

Image
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold #1112 Source  

LDM3D: Latent Diffusion Model for 3D (text to 360 image) #1111

Image
LDM3D: Latent Diffusion Model for 3D (text to 360 image) #1111 Source  

“slick” RLHF-alternative without RL #1110

Image
“slick” RLHF-alternative without RL #1110 Source  

The next iteration of Perplexity has arrived: Copilot, your interactive AI search companion (powered by GPT-4) #1109

Image
The next iteration of Perplexity has arrived: Copilot, your interactive AI search companion (powered by GPT-4) #1109 Source  

Dr. LLaMA: Improving Small Language Models in Domain-Specific QA via Generative Data Augmentation #1108

Image
Dr. LLaMA: Improving Small Language Models in Domain-Specific QA via Generative Data Augmentation #1108 Source