MAY
MAY 2023 Ai/Tech updates:
1. Nvidia released a 2b param model trained on 1.1T Tokens
2. Brain activity decoder can reveal stories in people’s minds
3. ArK: Augmented Reality with Knowledge Interactive Emergent Ability
4. GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation
5. Inflection AI, Startup From Ex-DeepMind Leaders, Launches Pi — A Chattier Chatbot
6. Generalizing Dataset Distillation via Deep Generative Prior
7. Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation
8. Introducing LLaVA Lightning: Train a lite, multimodal GPT-4 with just $40 in 3 hours
9. Unlimiformer: Long-Range Transformers with Unlimited Length Input
10. Learning Physically Simulated Tennis Skills from Broadcast Videos
11. 7B OpenLLaMA model that has been trained with 200 billion tokens on the RedPajama dataset
12. AG3D: Learning to Generate 3D Avatars from 2D Image Collections
14. TMR: Text-to-Motion Retrieval Using Contrastive 3D Human Motion Synthesis
16. CLIP ViT-L/14 model with 79.2% zero-shot accuracy on ImageNet
17. OpenAI's Shap-E: Generating Conditional 3D Implicit Functions
19. MaMMUT: A simple vision-encoder text-decoder architecture for multimodal tasks
20. AutoML-GPT: Automatic Machine Learning with GPT
21. Nvidia Real-Time Neural Appearance Models
22. Personalize Segment Anything Model with One Shot
26. Composite Motion Learning with Task Control
27. Multi-Space Neural Radiance Fields
28. Locally Attentional SDF Diffusion for Controllable 3D Shape Generation
31. Introducing LeMUR, short for Leveraging Large Language Models to Understand Recognized Speech
32. OpenAI GPT-4 to interpretability — automatically proposing explanations for GPT-2's 300k Neurons
33. TidyBot: Personalized Robot Assistance with Large Language Models
34. FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance
35. MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
38. Bard available in over 180 countries and territories including India and upgraded to PaLM 2
39. Google introduces PaLM 2 and it is coming to more than 25 products of Google
40. Google's Generative AI is coming to search
42. Synthesia research releases HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion
44. EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention
45. Assisted Generation: a new direction toward low-latency text generation (3x faster)
46. Artificial intelligence identifies anti-aging drug candidates targeting 'zombie' cells
49. MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers
50. HACK: Learning a Parametric Head and Neck Model for High-fidelity Animation
51. Consensus and subjectivity of skin tone annotation for ML fairness
52. Microsoft's TinyStories: How Small Can Language Models Be and Still Speak Coherent English
53. Google: Using reinforcement learning for dynamic planning in open-ended conversations
54. Multiple fully Tesla-made Bots now walking around & learning about the real world
55. Introducing Phoenix: a revolutionary humanoid general-purpose robot designed for work
57. Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback
58. FitMe: Deep Photorealistic 3D Morphable Model Avatars
59. Understanding 3D Object Interaction from a Single Image
60. Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation
61. AutoRecon: Automated 3D Object Discovery and Reconstruction
62. Dr. LLaMA: Improving Small Language Models in Domain-Specific QA via Generative Data Augmentation
64. “slick” RLHF-alternative without RL
65. LDM3D: Latent Diffusion Model for 3D (text to 360 image)
66. Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold
68. LIMA: Less Is More for Alignment
70. Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity
71. Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields
72. RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture
73. Meta's MMS: Massively Multilingual Speech - Can do speech2text and text speech in 1100 languages
76. Adobe just added their first Generative AI tool to Photoshop
77. Intel Announces Aurora genAI, Generative AI Model With 1 Trillion Parameters
78. Introducing Microsoft Fabric: Data analytics for the era of AI
79. Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
81. QLoRA: Efficient Finetuning of Quantized LLMs
82. Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks
84. SiamMAE: Siamese Masked Autoencoders for self-supervised representation learning from videos
85. Microsoft just announced Power Virtual Agent, a new generative actions engine in its chatbot builder
87. Stable Diffusion “Reimagine XL” model
88. Alexandria, an open-source initiative to embed the internet, starting with Arxiv
89. Neuralink recieved its first Human clinical trial approval from FDA
91. ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation
92. Voyager: An Open-Ended Embodied Agent with Large Language Models (LLM playing Minecraft)
93. state-of-the-art fMRI-to-image approach that retrieves and reconstructs images from brain activity
94. A Neural Space-Time Representation for Text-to-Image Personalization (similar to Dreambooth)
95. ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing
96. Think Before You Act: Decision Transformers with Internal Working Memory
97. PandaGPT: One Model To Instruction-Follow Them All
98. Break-A-Scene: Extracting Multiple Concepts from a Single Image
99. Photoswap: Personalized Subject Swapping in Images
100. Generating Images with Multimodal Language Models
101. Nvidia showcases real-time AI conversation in a game using voice
102. AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation
103. StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation
106. Large sequence models for software development activities
107. Japan Goes All In: Copyright Doesn’t Apply To AI Training
Comments
Post a Comment