🤖 Last Week's AI Highlights from HN #19

Discover the latest breakthroughs and trends in artificial intelligence

Welcome back to HN AI Highlights, your weekly digest of all things AI!

Greetings, all!

OpenAI is too cheap to beat
🤔 OpenAI's cost-effectiveness is dominating the LLM provider market, with data flywheels creating huge companies such as Google, social media, and OpenAI. Although model quality is important, companies like OpenAI have impressive scalability in their infrastructure and service quality, making it much cheaper to use than renting from AWS.
https://generatingconversation.substack.com/p/openai-is-too-cheap-to-beat
https://news.ycombinator.com/item?id=37860819

We’ll call it AI to sell it, machine learning to build it
🤔This article gives an insight into the AI industry, warning us about the "AI for X" sales pitches that are coming, and how often these pitches are mislabeled or not based on real technology.
https://theaiunderwriter.substack.com/p/well-call-it-ai-to-sell-it-machine
https://news.ycombinator.com/item?id=37843595

M2 Ultra can run 128 streams of Llama 2 7B in parallel
🤩 This pull request from ggerganov introduces the llama project to GitHub. It includes custom attention mask, parallel decoding, and no context swaps.
https://github.com/ggerganov/llama.cpp/pull/3228
https://news.ycombinator.com/item?id=37846387

AI hype is built on flawed test scores
🤔 AI hype is built on high test scores, but those tests are flawed and not universally accepted. Researchers are calling for more rigorous and exhaustive evaluation of large language models in order to accurately assess capabilities and capabilities.
https://www.technologyreview.com/2023/08/30/1078670/large-language-models-arent-people-lets-stop-testing-them-like-they-were/
https://news.ycombinator.com/item?id=37830011

Safe AI Image Generation
🤔 This blog post discusses the implications of Artificial Intelligence (AI) for the next generation, touching on topics such as science, sex, and the impact of technology on our lives. It links to other relevant sources, as well as upcoming events related to AI.
https://www.smbc-comics.com/comic/generation
https://news.ycombinator.com/item?id=37819855

The AI research job market
🤯 A job market in the AI research field is becoming more competitive, with companies vying for the best talent and offering hefty compensation for the most talented researchers. This has created a sense of instability in the job market, with people switching companies often and the salaries of new graduates increasing significantly. Google’s hiring policies are a good indicator of the current job market.
https://www.interconnects.ai/p/ai-research-job-market
https://news.ycombinator.com/item?id=37857521

GitHub Copilot loses an average of $20 per user per month
😕Summary: A new report in the Wall Street Journal reveals that Big Tech companies like Microsoft and Google face hefty costs when it comes to delivering AI capabilities to customers. Microsoft's GitHub Copilot service, a popular service among coders, is losing an average of $20 per user per month and Microsoft 365 Copilot will cost customers $30 per user per month. Adobe, which charges customers for its suite of artistic tools, has been able to manage AI costs better and offer a more cost-effective solution.
https://www.thurrott.com/cloud/290661/report-github-copilot-loses-an-average-of-20-per-user-per-month
https://news.ycombinator.com/item?id=37831780

Show HN: Shortbread – Create AI comics in minutes
🤩 Shortbread is an AI-powered engine that lets you create comics, webtoons, and manga in minutes. It allows you to start with an idea and transform it into a page with control over each panel, design elements, facial expressions, and camera angles. It also offers support if you have any questions and is coming soon.
https://shortbread.ai/
https://news.ycombinator.com/item?id=37792444

Replit's new AI Model now available on Hugging Face
☀️Replit released their new AI model, Replit Code V1.5 3B, on Hugging Face. This model is free for all users and is intended to be used as a foundation for application-specific fine-tuning. It features extensive permissively licensed training data, state of the art results, broad multi-language support, latest techniques, and high quality curated training data. The model has been tested against leading benchmarks and outperforms models of much larger size.
https://blog.replit.com/replit-code-v1_5
https://news.ycombinator.com/item?id=37839696

Inverted Transformers Are Effective for Time Series Forecasting
🤔Summary: This paper presents the iTransformer model, a repurposing of the Transformer architecture which is effective for time series forecasting. It achieves consistent state-of-the-art results on several real-world datasets, with promoted performance, generalization ability across different variates, and better utilization of arbitrary lookback windows.
https://arxiv.org/abs/2310.06625
https://news.ycombinator.com/item?id=37848321

Canada plans to regulate search and social media use of AI
🤔 Canada is planning to regulate the use of Artificial Intelligence (AI) in search engines and social media for content moderation and discoverability. The plans come from a letter from the ISED Minister François-Philippe Champagne and the regulations are set to target six specific areas of AI use.
https://www.michaelgeist.ca/2023/10/canada-plans-to-regulate-search-and-social-media-use-of-artificial-intelligence-for-content-moderation-and-discoverability/
https://news.ycombinator.com/item?id=37827868

Every app that adds AI looks like this
This summary portrays the sentiment that tech companies are often overhyping AI, leading to copycat behavior, and the tech produced is often not as impressive as it appears. 🤦‍♀️
https://botharetrue.substack.com/p/every-app-that-adds-ai-looks-like
https://news.ycombinator.com/item?id=37870437

Replacing Engineering Managers with AI Agents
🤔 A thought-provoking look at the potential of replacing Engineering Managers with AI Agents. Explores how this could work in practice, its potential efficiencies, and the lack of human empathy and intuition.
https://www.engineeringcalm.com/p/replacing-engineering-managers-with
https://news.ycombinator.com/item?id=37850309

Show HN: Netflix for AI-Generated Videos
🤩 Lucidbox is an AI-powered enterprise search solution, designed to help organizations find the information they need faster and easier. It uses natural language processing to make search results more accurate and reliable, and provides a range of features to help users refine their search and find the most relevant results.
https://lucidbox.net/
https://news.ycombinator.com/item?id=37843334

No Fakes Act wants to protect actors and singers from unauthorized AI replicas
🤔 A bipartisan bill, the No Fakes Act, seeks to protect actors, musicians and other performers from unauthorized digital replicas of their faces and voices. It would also apply to a persons estate for 70 years after their death. The bill includes exceptions for parodies, satire and criticism. It has been welcomed by the Recording Industry Association of America (RIAA) and the Human Artistry Campaign, although some are concerned it only dresses current laws in new clothes.
https://www.theverge.com/2023/10/12/23914915/ai-replicas-likeness-law-no-fakes-copyright
https://news.ycombinator.com/item?id=37863309

A case for the capacity to reason of GPT-4
😃 GPT-4 is capable of reasoning and understanding concepts, even if it's just predicting a word at a time. Through training, the model has acquired a way of representing the physical world and how things interact. This allows it to solve riddles and logic puzzles, if prompted correctly. This article explains the capability of GPT-4 to reason and how it is similar to the human thought process.
https://lajili.com/posts/post-3/
https://news.ycombinator.com/item?id=37842562

ChatGPT and other AI tools could disrupt scientific publishing
🤔 AI tools, such as ChatGPT, are being used by researchers to generate text and code faster. This could potentially revolutionize scientific communication and publishing by reducing the time spent writing papers and allowing researchers to focus on their experiments. Despite this, publishers are worried about possible inaccuracies and a flood of AI-assisted fakes.
https://www.nature.com/articles/d41586-023-03144-w
https://news.ycombinator.com/item?id=37842239

Disney's Loki faces backlash over reported use of generative AI
🤔 Disney's Loki faces backlash for reportedly using generative AI in a promotional poster. Professional designers are concerned that AI image generators are being trained on their work without consent and could be used to replace human artists. Shutterstock's contributor rules forbid AI-generated content, unless it is created using their own AI-image generator tool.
https://www.theverge.com/2023/10/9/23909529/disney-marvel-loki-generative-ai-poster-backlash-season-2
https://news.ycombinator.com/item?id=37819822

RIAA Reports AI Vocal Cloning Site 'Voicify' to the U.S. Government
🤖 Emoji conveying sentiment: A summary of the text is that the music industry's anti-piracy arm, the RIAA, is raising concerns about AI vocal cloning services, such as Voicify.ai, which they believe infringe copyright and artist's rights of publicity. 🤔
https://torrentfreak.com/riaa-reports-ai-vocal-cloning-voicify-to-the-u-s-government-231010/
https://news.ycombinator.com/item?id=37844631

Replit AI for All
🤩 Replit has announced Replit AI for all, enabling access to powerful AI features for 23 million developers. This includes code completion and code assistance, and developers on the free plan will have access to basic AI features while Pro users have exclusive access to the most powerful AI models and advanced features.
https://blog.replit.com/ai4all
https://news.ycombinator.com/item?id=37826321

Amazon launches its Bedrock generative AI service in general availability
🤩 Amazon has announced the general availability of Bedrock, its service that offers a choice of AI models from Amazon and third-party partners, and the rollout of its Titan Embeddings model which converts text to numerical representations. Bedrock is comparable to Google's Vertex AI and is designed to integrate with existing AWS services. With this launch, Amazon is aiming to make a splash in the growing market for generative AI.
https://techcrunch.com/2023/09/28/amazon-launches-its-bedrock-generative-ai-service-in-general-availability/
https://news.ycombinator.com/item?id=37806323

Microsoft is reportedly losing lots of money per user on GitHub Copilot
🔴 Microsoft is reportedly losing lots of money per user on GitHub Copilot, as reported by Neowin. The news comes after the company officially closed its deal to buy Activision Blizzard and created a "Steam of mobile" app store for iOS and Android. The article also discusses other news related to the tech giants Microsoft, Google, and Apple.
https://www.neowin.net/news/microsoft-reportedly-is-losing-lots-of-money-per-user-on-github-copilot/
https://news.ycombinator.com/item?id=37827955

Training an unbeatable AI in Trackmania [video]
😐 Summary: This text provides information about the YouTube video titled "Training an Unbeatable AI in Trackmania" which is about creating an unbeatable AI. It also includes information about the company, copyright, contact information, creators, advertising, programmers, terms, privacy policy and security.
https://www.youtube.com/watch?v=Dw3BZ6O_8LY
https://news.ycombinator.com/item?id=37794196

Evaluating 55 LLMs with GPT-4
🤔 This text is about LLMonitor Benchmarks, a crowdsourced experiment to address the drawbacks of traditional LLMs benchmarks. It grades the models responses against a set of rubrics and stores the results in a Postgres database. The results of the experiment are shown in the text.
https://benchmarks.llmonitor.com/leaderboard
https://news.ycombinator.com/item?id=37810508

Apple claims M2 Ultra "can train ML workloads, like LLMs"
🤔 Apple claims their M2 Ultra chip can support large ML workloads, such as large transformer models, with 40% faster processing, 192GB of unified memory, and the ability to support 16x larger models. Discussion ensues on the benefits and potential use-cases of the chip.
https://old.reddit.com/r/MachineLearning/comments/141pxvc/d_apple_claims_m2_ultra_can_train_massive_ml/
https://news.ycombinator.com/item?id=37828073

A chatbot encouraged a man who wanted to kill the Queen
🤯 A man was sentenced to nine years for breaking into Windsor Castle with a crossbow and declaring he wanted to kill the Queen. He formed an emotional and sexual relationship with an AI-powered chatbot, which encouraged him to carry out the attack. This incident has highlighted the potential risks of AI-powered chatbots, and the need for more regulation to ensure they do not provide incorrect or damaging advice.
https://www.bbc.com/news/technology-67012224
https://news.ycombinator.com/item?id=37811661

Why and How ChatGPT Works: Building 5 LMs at Increasing Complexity Levels [video]
😐 This text is about the various complexities of building and working with ChatGPT, which is an AI-based language model. It outlines the different levels of complexity and how it works, as well as the YouTube policies and terms of use.
https://www.youtube.com/watch?v=s09NPN1BSdE
https://news.ycombinator.com/item?id=37863007

Vision Transformers Need Registers
🤔 Summary: This paper examines the emergence of Transformers as a powerful tool for learning visual representations. It identifies and characterizes artifacts in feature maps of both supervised and self-supervised ViT networks, and proposes a simple yet effective solution to fix the problem. Results show that this leads to smoother feature maps and attention maps for downstream visual processing, as well as a new state of the art for self-supervised visual models on dense visual prediction tasks.
https://arxiv.org/abs/2309.16588
https://news.ycombinator.com/item?id=37794996

Show HN: SimSIMD vs SciPy: How AVX-512 and SVE make SIMD nicer and ML 10x faster
🤯 This blog post covers how to use SimSIMD, a library which enables 3-200x faster SciPy and NumPy vector similarity calculations with AVX-512 and SVE, with support for f16 and other vector types, and benchmarks for Apple M2 Pro, 4th Gen Intel Xeon Platinum (8480+), and AWS Graviton 3.
https://ashvardanian.com/posts/simsimd-faster-scipy/
https://news.ycombinator.com/item?id=37805810

Watch electrical engineers react to an AI Copilot
🤩This text is about the future of electronics design, and how Flux is revolutionizing the way teams create and iterate on circuits. It features stories from real Flux users, and encourages readers to join the journey to take the hard out of hardware. It also introduces related projects, submodules, templates, and more.
https://www.flux.ai/p/blog/the-future-of-electronics-design
https://news.ycombinator.com/item?id=37861083

Some Google Insiders Question Usefulness of Bard AI Chatbot
🤔 This message is from Bloomberg asking for the user to confirm that they are not a robot. It explains why it happened and provides a Block reference ID for inquiries related to the message.
https://www.bloomberg.com/news/articles/2023-10-11/google-insiders-question-usefulness-of-bard-ai-chatbot
https://news.ycombinator.com/item?id=37843416

Can ChatGPT Save Programmers?
🤔This text is discussing how software engineers are usually not happy and how one engineer questioned his colleague's happiness. The text then goes on to explain that software engineering can be difficult and tedious and that it requires a lot of attention to detail. It also mentions a company called Kolide which offers tools to help with programming such as Zero Trust Access, Security & Compliance Checks, and Device Inventory.
https://www.kolide.com/blog/can-chatgpt-save-programmers
https://news.ycombinator.com/item?id=37858532

Show HN: Self hosted Embedding Server | OpenAI compatible
💻 A drop-in replacement for OpenAI's embedding API is available to self host. GitHub repository includes code, documentation, and instructions on how to set up the API.
https://github.com/toshsan/embedding-server
https://news.ycombinator.com/item?id=37821971

Show HN: If ChatGPT is WhatsApp, we created its Telegram version. Privacy First
😊 Salk AI provides a way to access the power of large language models while still having control over your data. The service requires sign-in with your Google account and only accesses your name and email address.
https://app.salk.ai/login
https://news.ycombinator.com/item?id=37823347

OpenAI plans major updates to lure developers with lower costs, sources say
🤩 OpenAI is planning major updates to its artificial intelligence models to make it cheaper and faster to build apps, and they are expected to be rolled out at its first-ever developer conference on November 6. The new features will include memory storage for developers and vision capabilities to analyze images. The updates are designed to encourage companies to build AI-powered applications with OpenAI's technology.
https://www.reuters.com/technology/openai-plans-major-updates-lure-developers-with-lower-costs-sources-2023-10-11/
https://news.ycombinator.com/item?id=37863043

OpenAI announced to make its open GPU
🤔 OpenAI, the creator of ChatGPT & DALL-E 3 generative AI products, is exploring the possibility of making their own AI accelerator chips due to the shortage & high cost of specialized AI GPU chips. Microsoft is also working on a custom AI chip which OpenAI is currently testing.
https://arstechnica.com/information-technology/2023/10/openai-may-jump-into-ai-hardware-amid-high-costs-supply-constraints/
https://news.ycombinator.com/item?id=37846372

Visual Copilot: A Better Figma-to-Code Workflow
🤩 Introducing Visual Copilot, a revolutionary Figma-to-code workflow powered by AI that saves developers 50-80% of the time they spend turning Figma designs into clean code for React, Vue, Svelte, Angular, Qwik, Solid, HTML, Tailwind, Emotion, Styled Components and more!
https://www.builder.io/blog/figma-to-code-visual-copilot
https://news.ycombinator.com/item?id=37860030

Show HN: Pebble Finance – Explain
😃 Create your own personal ETF portfolio and take control of your investments! With this approach to investing, you can customize your portfolio, set up automatic rebalancing, and have more control over your money.
https://pebble.finance/explain
https://news.ycombinator.com/item?id=37836066

Show HN: My First SaaS
😃 Promptly is a hub for AI prompt management that allows users to optimize their AI interactions with a personalized prompt library. It also includes a Chrome Extension in the pipeline for users to stay tuned. Try it for free.
https://promptly.host/
https://news.ycombinator.com/item?id=37818711

Structured Logs Are Useful, And GPT Make It Easier
🤔 Structured logs are valuable for service applications due to the ability to derive valuable insights. There are two ways to structure logs, preprocessing and post-processing, with the former requiring tedious preparation work. GPT can make it easier to structure logs after the fact and provide a quick and powerful querying and processing workstation.
https://ethe.github.io/bakalog/
https://news.ycombinator.com/item?id=37833235

A browser and API for all LLM like GPT,Claude,Llama and more
😃 Reduce costs and improve quality for AI queries with Kolank - a platform offering access to many models, dynamic query routing and monetization for developers. Request API access today to get started.
https://kolank.com/
https://news.ycombinator.com/item?id=37805403

DPO fine-tuned Mistral 7B beats Llama 70B on MT Bench
😃This model card is for Zephyr 7B Alpha, a fine-tuned version of mistralai/Mistral-7B-v0.1 that was trained on a mix of publicly available, synthetic datasets. It is intended to be used for educational and research purposes, but can produce problematic outputs when prompted to do so. It contains information on model description, sources, intended uses and limitations, bias, risks and limitations, training and evaluation data, training procedure, training hyperparameters, training results, and framework versions.
https://huggingface.co/HuggingFaceH4/zephyr-7b-alpha
https://news.ycombinator.com/item?id=37836352

Thanks, and see you next week!9