JUNE

 JUNE updates. Focusing on quality over quantity now.

1. Meta Quest 3 officially announced by Mark Zuckerberg

2. Nvidia: Introducing Neuralangelo - A new AI model to turn 2D video from any device -- cell phone to drone capture -- into 3D structures with intricate details

3. AVFormer: Injecting vision into frozen speech models for zero-shot AV-ASR

4. SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds

5. Bytes Are All You Need: Transformers Operating Directly On File Bytes

6. Diffusion Self-Guidance for Controllable Image Generation

7. LLaMa-Adapter Multimodal

8. Humans in 4D: Reconstructing and Tracking Humans with Transformers

9. StyleDrop - It allows a user to generate new images that follow a specific style of their choice given only a single style reference image

10. Segment Anything in High Quality

11. Apple Vision Pro headset

12. Meta AI — Hiera is an extremely simple hierarchical vision transformer that's both more accurate than previous models + significantly faster at inference and during training

13. VideoComposer: Compositional Video Synthesiswith Motion Controllability

14. Meet LTM-1: LLM with 5,000,000 prompt tokens (coming in future)

15. Text-to-3D Character From DAZ 3D

16. AlphaDev, an AI system using reinforcement learning to discover enhanced computer science algorithms

17. Stability AI launches Uncrop, a new outpainting tool powered by Stable Diffusion XL

18. Recognize Anything: A Strong Image Tagging Model

19. Tracking Everything Everywhere All at Once

20. Meta just released MusicGen, a simple and controllable model for music generation

21. OpenLLaMA 7B trained on 1T tokens

22. Google's Generative AI support on Vertex AI is now generally available (PaLM 2 and more)

23. Introducing SlimPajama-627B: the largest extensively deduplicated, multi-corpora, open-source dataset for training large language models

24. OpenAI's ChatGPT June 13 Update - New Model versions, 16k context length for GPT-3.5, Lower API cost & more!

25. Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

26. Augmenting Language Models with Long-Term Memory

27. Google: How AI makes virtual clothing try-on more realistic

28. Seeing the World through Your Eyes

29. Progressively Optimized Local Radiance Fields for Robust View Synthesis (Better NeRFs)

30. MetaAI: Introducing Voicebox, a new breakthrough generative speech system

31. OpenLLaMA 13B Released

32. DeepMind: Introducing RoboCat, a new AI model designed to operate multiple robots

33. MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing

34. Textbooks Are All You Need

35. Inflection-1, powering Pi - outperforming GPT-3.5, Llama and PALM-540B

36. Stability AI - Stable Diffusion XL 0.9 (SDXL 0.9)

37. Fast Segment Anything

38. zeroscope_v2 - Open Source Text to Video Model

39. First drug discovered and designed with generative AI enters Phase II trials, with first patients dosed

40. Salesforce Introduces XGen-7B, a new 7B LLM trained on 8K seq. length for 1.5T tokens

41. Introducing Playground with Mixed Image Editing - AI Image Editor

42. New GPT-4 powered Microsoft Shopping tools arrive on the new Bing and Edge

43. whole-brain connectome of the fruit fly, including ~130k annotated neurons and tens of millions of typed synapses

44. Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

45. CSM AI releases Any Image to 3D

46. One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization

47. DreamDiffusion: Generating High-Quality Images from Brain EEG Signals

48. OpenFlamingo V2

49. Pi chatbot now available in over 70+ countries


Comments

Popular posts from this blog