Asif Razzaq - Mochiai.blog

A Coding Implementation to Train Safety-Critical Reinforcement Learning Agents Offline Using Conservative Q-Learning with d3rlpy and Fixed Historical Data

By Asif Razzaq
February 4, 2026

In this tutorial, we build a safety-critical reinforcement learning pipeline that learns entirely from fixed, offline data rather than live exploration. We design a…

How to Build Multi-Layered LLM Safety Filters to Defend Against Adaptive, Paraphrased, and Adversarial Prompt Attacks

By Asif Razzaq
February 3, 2026

In this tutorial, we build a robust, multi-layered safety filter designed to defend large language models against adaptive and paraphrased attacks. We combine semantic…

Robbyant Open Sources LingBot World: a Real Time World Model for Interactive Simulation and Embodied AI

By Asif Razzaq
January 31, 2026

Robbyant, the embodied AI unit inside Ant Group, has open sourced LingBot-World, a large scale world model that turns video generation into an interactive…

AI2 Releases SERA, Soft Verified Coding Agents Built with Supervised Training Only for Practical Repository Level Automation Workflows

By Asif Razzaq
January 30, 2026

Allen Institute for AI (AI2) Researchers introduce SERA, Soft Verified Efficient Repository Agents, as a coding agent family that aims to match much larger…

Beyond the Chatbox: Generative UI, AG-UI, and the Stack Behind Agent-Driven Interfaces

By Asif Razzaq
January 29, 2026

Most AI applications still showcase the model as a chat box. That interface is simple, but it hides what agents are actually doing, such…

How a Haystack-Powered Multi-Agent System Detects Incidents, Investigates Metrics and Logs, and Produces Production-Grade Incident Reviews End-to-End

By Asif Razzaq
January 27, 2026

@tool def sql_investigate(query: str) -> dict: try: df = con.execute(query).df() head = df.head(30) return { "rows": int(len(df)), "columns": list(df.columns), "preview": head.to_dict(orient="records") } except Exception…

How Machine Learning and Semantic Embeddings Reorder CVE Vulnerabilities Beyond Raw CVSS Scores

By Asif Razzaq
January 24, 2026

def visualize_results(df, priority_scores, feature_importance): fig, axes = plt.subplots(2, 3, figsize=(18, 10)) fig.suptitle('Vulnerability Scanner - ML Analysis Dashboard', fontsize=16, fontweight='bold') axes[0, 0].hist(priority_scores, bins=30, color='crimson', alpha=0.7,…

Qwen Researchers Release Qwen3-TTS: an Open Multilingual TTS Suite with Real-Time Latency and Fine-Grained Voice Control

By Asif Razzaq
January 23, 2026

Alibaba Cloud’s Qwen team has open-sourced Qwen3-TTS, a family of multilingual text-to-speech models that target three core tasks in one stack, voice clone, voice…

FlashLabs Researchers Release Chroma 1.0: A 4B Real Time Speech Dialogue Model With Personalized Voice Cloning

By Asif Razzaq
January 22, 2026

Chroma 1.0 is a real time speech to speech dialogue model that takes audio as input and returns audio as output while preserving the…

How to Design a Fully Streaming Voice Agent with End-to-End Latency Budgets, Incremental ASR, LLM Streaming, and Real-Time TTS

By Asif Razzaq
January 20, 2026

In this tutorial, we build an end-to-end streaming voice agent that mirrors how modern low-latency conversational systems operate in real time. We simulate the…

Load More Posts