Harry Wang – Medium

Harry Wang

Understanding Python Tests and Git Patches for OpenAI SWE Bench Verified Dataset

OpenAI released a human-validated subset of SWE-bench dataset, named SWE-bench Verified, to help with evaluating LLMs’ ability to solve…

Aug 20, 2024

Understanding Python Tests and Git Patches for OpenAI SWE Bench Verified Dataset

Aug 20, 2024

A Quick Comparison of Text-to-Image Models: Flux, Stable Diffusion 3, DALL·E 3, and Kling

Last week, a new state-of-the-art text-to-image model called Flux was released by Black Forest Labs (the original creators of Stable…

Aug 9, 2024

A Quick Comparison of Text-to-Image Models: Flux, Stable Diffusion 3, DALL·E 3, and Kling

Aug 9, 2024

Use Poetry for Python Package and Dependency Management

I see more Python projects are using Poetry to manage packages and dependencies and want to learn what Poetry is and why it’s better than…

Jul 14, 2024

Use Poetry for Python Package and Dependency Management

Jul 14, 2024

A Dataset for Teaching and Evaluating RAG

As a fan of Acquired (https://www.acquired.fm/), I recently published a dataset containing 200 Acquired Podcast Transcripts with metadata…

Jun 15, 2024

A Dataset for Teaching and Evaluating RAG

Jun 15, 2024

How to Setup a (Real) Self-Contained Python Repository

Have you ever find some Python code on Github and could not easily run them locally due to issues like missing data, packages, versioning…

May 1, 2024

How to Setup a (Real) Self-Contained Python Repository

May 1, 2024

The “CAR” Problem of LLMs

When I teach Retrieval-Augmented Generation (RAG), I defined the “CAR” (Credibility, Accuracy, and Recency) problem to outline the common…

Apr 24, 2024

The “CAR” Problem of LLMs

Apr 24, 2024

Index Your Serverless MongoDB

TL;DL: Implementing indexing on our Serverless MongoDB databases slashed our costs by a factor of at least 10.

Apr 14, 2024

Index Your Serverless MongoDB

Apr 14, 2024

The Tech Stack that Works for My (AI) Startups

Feb 26, 2024

The Tech Stack that Works for My (AI) Startups

Feb 26, 2024

Data Labeling using GPT APIs

Feb 1, 2024

Data Labeling using GPT APIs

Feb 1, 2024

Deploy Stable Diffusion for AI Image Generation

Aug 23, 2022

Deploy Stable Diffusion for AI Image Generation

Aug 23, 2022

Manage Long-running Python Tasks using JupyterHub on Remote Ubuntu Server

I recently re-configured my Ubuntu server with JupyterHub so that I can manage a long-running Python task with a large dataset, which…

Aug 5, 2022

Manage Long-running Python Tasks using JupyterHub on Remote Ubuntu Server

Aug 5, 2022

CloseSea — the Smallest NFT Marketplace

I developed and deployed CloseSea: the smallest NFT marketplace :) based on the great tutorial by Nader Dabit.

Jun 9, 2022

CloseSea — the Smallest NFT Marketplace

Jun 9, 2022

A Practical Guide to Quasi-Experimental Methods (PSM and DID)

A quasi-experiment is an empirical interventional study used to estimate the causal impact of an intervention on target population without…

Jun 1, 2022

A Practical Guide to Quasi-Experimental Methods (PSM and DID)

Jun 1, 2022

When Data Science Meets NFT

Apr 23, 2022

When Data Science Meets NFT

Apr 23, 2022

A Minimalist End-to-End Machine Learning Tutorial

This tutorial is developed based on teaching materials from my courses at the University of Delaware and my professional training workshops…

Mar 3, 2022

A Minimalist End-to-End Machine Learning Tutorial

Mar 3, 2022

Get Test Ethers using Selenium

I need some test ethers for Rinkeby and found out that the faucet is not working (I also don’t like that you have to tweet to get the…

Feb 24, 2022

Get Test Ethers using Selenium

Feb 24, 2022

Developing Tailpages: A Jekyll Template based on Tailwind CSS

Tailpages Developer Tutorial

Jan 12, 2022

Developing Tailpages: A Jekyll Template based on Tailwind CSS

Jan 12, 2022

Introducing Tailpages (Tailwind + Github Pages)

Setup free-hosted beautiful website without coding

Jan 11, 2022

Introducing Tailpages (Tailwind + Github Pages)

Jan 11, 2022

The Triangle of Knowledge

How Human-Machine Transfer Knowledge

Jul 23, 2020

The Triangle of Knowledge

Jul 23, 2020

A Minimalist End-to-End Scrapy Tutorial (Part V)

Systematic Web Scraping for Beginners

Apr 13, 2020

A Minimalist End-to-End Scrapy Tutorial (Part V)

Apr 13, 2020