Understanding Python Tests and Git Patches for OpenAI SWE Bench Verified DatasetOpenAI released a human-validated subset of SWE-bench dataset, named SWE-bench Verified, to help with evaluating LLMs’ ability to solve…Aug 20, 2024Aug 20, 2024
A Quick Comparison of Text-to-Image Models: Flux, Stable Diffusion 3, DALL·E 3, and KlingLast week, a new state-of-the-art text-to-image model called Flux was released by Black Forest Labs (the original creators of Stable…Aug 9, 2024Aug 9, 2024
Use Poetry for Python Package and Dependency ManagementI see more Python projects are using Poetry to manage packages and dependencies and want to learn what Poetry is and why it’s better than…Jul 14, 2024Jul 14, 2024
A Dataset for Teaching and Evaluating RAGAs a fan of Acquired (https://www.acquired.fm/), I recently published a dataset containing 200 Acquired Podcast Transcripts with metadata…Jun 15, 2024Jun 15, 2024
How to Setup a (Real) Self-Contained Python RepositoryHave you ever find some Python code on Github and could not easily run them locally due to issues like missing data, packages, versioning…May 1, 2024May 1, 2024
The “CAR” Problem of LLMsWhen I teach Retrieval-Augmented Generation (RAG), I defined the “CAR” (Credibility, Accuracy, and Recency) problem to outline the common…Apr 24, 2024Apr 24, 2024
Index Your Serverless MongoDBTL;DL: Implementing indexing on our Serverless MongoDB databases slashed our costs by a factor of at least 10.Apr 14, 2024Apr 14, 2024
Manage Long-running Python Tasks using JupyterHub on Remote Ubuntu ServerI recently re-configured my Ubuntu server with JupyterHub so that I can manage a long-running Python task with a large dataset, which…Aug 5, 2022Aug 5, 2022
CloseSea — the Smallest NFT MarketplaceI developed and deployed CloseSea: the smallest NFT marketplace :) based on the great tutorial by Nader Dabit.Jun 9, 2022Jun 9, 2022
A Practical Guide to Quasi-Experimental Methods (PSM and DID)A quasi-experiment is an empirical interventional study used to estimate the causal impact of an intervention on target population without…Jun 1, 2022Jun 1, 2022
A Minimalist End-to-End Machine Learning TutorialThis tutorial is developed based on teaching materials from my courses at the University of Delaware and my professional training workshops…Mar 3, 2022Mar 3, 2022
Get Test Ethers using SeleniumI need some test ethers for Rinkeby and found out that the faucet is not working (I also don’t like that you have to tweet to get the…Feb 24, 2022Feb 24, 2022
Developing Tailpages: A Jekyll Template based on Tailwind CSSTailpages Developer TutorialJan 12, 2022Jan 12, 2022
Introducing Tailpages (Tailwind + Github Pages)Setup free-hosted beautiful website without codingJan 11, 2022Jan 11, 2022
A Minimalist End-to-End Scrapy Tutorial (Part V)Systematic Web Scraping for BeginnersApr 13, 20201Apr 13, 20201