Generative AI for high quality natural dialogues, A new language for controlling LLMs by Microsoft, Generative AI-based multimodal wearable device, No-code automation + AI and more
Greetings and welcome to this week's AI Brews - your thoughtfully curated guide to AI products, learning resources and a concise roundup of the week's impactful news. Our goal? To provide a balanced selection in the rapidly evolving AI landscape, keeping you well-informed without the information overload. We value your feedback - don't hesitate to reply to this email with suggestions on how we can make this better for you. Thanks!
In today’s issue:
AI Pulse: News, Insights and Social Spotlight of the Week
AI Toolbox: Product Picks of the Week
AI Skillset: Learn & Build
🗞️ AI Pulse: News, Insights and Social Spotlight of the Week
🔥 News & Insights
Google presents SoundStorm - a new model for efficient audio generation. It can generate highly realistic dialogues via transcript annotations and short voice prompts. See video below or more in examples [Paper].
Microsoft releases a new language for controlling large language models: ‘Guidance’. Guidance enables you to control modern language models more effectively and efficiently than traditional prompting or chaining [Details].
Zapier launched two new AI beta features for their no-code automation platform:
Create a Zap using plain English: Simply describe what you want to automate using natural language.
Code with AI: Describe in natural language what you'd like to do in your ‘Code step’, and AI will generate the code [Details | Beta Access].
Stability AI released StableStudio - the open-source variant of DreamStudio, their text-to-image app, with plans to create bounties for new features [Details | GitHub].
Project Ring: A generative AI-based wearable device with a camera and microphone that can chat with you about what it sees. Powered by OpenAI Whisper (voice-to-text), Replicate (image-to-text), ChatGPT (text-to-text), and ElevenLabs (text-to-voice) . The entire code - Raspberry Pi Python script, cloud application, HTML webpage, and Android app - was written by GPT-4! [Details | Youtube Link]
Meta shares plans for their next generation of AI infrastructure: a custom silicon chip (MTIA - Meta Training and Inference Accelerator) for running AI models , a new AI-optimized data center design and the second phase of their 16,000 GPU supercomputer for AI research [Details].
Apple shares upcoming AI-based features for cognitive, speech, and vision accessibility along with voice cloning. A notable feature is 'Point and Speak' in the stock Magnifier app, which uses the camera, LiDAR Scanner, and on-device machine learning to read aloud button labels on appliances like microwaves as users move their finger across the keypad [Details].
OpenAI has introduced the ChatGPT app for iOS, offering voice input functionality, and initially available to customers in the US [Link].
Cloudflare introduced Constellation: a new feature to run fast, low-latency inference tasks using pre-trained machine learning models natively with Cloudflare Workers scripts [Details].
Glide, the no-code tool for building custom apps, now includes integration with OpenAI in Glide apps [Details | Guide].
Google’s Colab will soon have AI coding features like code completions, natural language to code generation and a code-assisting chatbot. Colab will use Codey, a family of code models built on PaLM 2, which was announced at I/O last week [Details].
Google Cloud launched two new AI-powered solutions to help biotech and pharmaceutical companies accelerate drug discovery and advance precision medicine [Details].
Anthropic announced a partnership with Zoom. Zoom will use Claude, Anthropic’s AI chatbot, to build customer-facing AI products [Details].
Hippocratic AI has built a safety-focused large language model for healthcare to assist with tasks such as explaining billing, providing dietary and medication advice, answering surgery-related queries, patient onboarding etc. [Details].
OpenAI is rolling out ChatGPT plugins and web browsing feature to all paid users. Check and enable via ‘Beta features’ in the ChatGPT ‘Settings’ [Details].
🔦 Social Spotlight
AI notes for a 30-min psychiatry therapy session [Link].
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold. Demo in the link [Link].
Research on detection of AI-generated text [ Link].
PrivateGPT, a custom knowledge chatbot that can be used offline [Link].
🔍 🛠️ AI Toolbox: Product Picks of the Week
Luna
Luna is a sales tool for lead generation and email outreach. It uses AI to suggest high-quality leads daily, tailoring its suggestions based on user feedback. Luna can also generate personalized emails for each prospect by analyzing information from the prospect's website and social profiles.
Solvemigo
A telegram bot that provides a convenient, on-the-go interface to ChatGPT, Dall-E and Whisper, enabling users to ask questions and generate images using both text and voice (supports 60 languages).
ReRoom AI and ReRender AI
ReRoom AI and ReRender AI, both developed by the same company, are AI tools to generate photorealistic renders in a variety of styles from provided images. While ReRoom AI caters to interior designers, ReRender AI is for architects. Their functionality is similar - all one needs to do is upload an initial image, which could be from an AutoCad or Blender project or even a hand-drawn sketch of a room or building, select the desired style and the tool will do the rest - creating a photorealistic render.
📕 AI Skillset: Learn & Build
Zapier CEO Wade Foster explains, in this open letter, how customers are using AI and Zapier to transform their workflows without coding.
Build a No-Code chat-with-PDF LangChain app using Flowise and Bubble and add a chat widget to any website:
A tutorial on Generative AI by Google Cloud : What is Generative AI, common applications, model types and how to use it.
LangChain 101 - A free LongChain video course with Replit projects (3/6 parts published so far):
Thank you for reading AI Brews! If you have any thoughts, questions or just want to say hello, please don't hesitate to hit reply. Mariam
Project Ring is super neat!