Ethan Nguyen

MacBook -> Thinkpad + Arch Linux

2024-07-30T00:00:00-07:00

I have been a Apple user (fanboy) for the last 8 years. Here is a slightly embarrassing list of Apple products I have owned:

MacBook Pro x2
iPad Pro x2
iPhone x2
Watch x2
Airpods x3
Magic Keyboard x2

I’ve really enjoyed my experience with Apple. I love how “everything just works” and the tight integration between software and hardware within Apple’s ecosystem. I can respond to text messages on any device and my photos are automatically synced to iCloud. However, I have become annoyed with how restrictive Apple’s ecosystem has become. As my hardware stack has aged, I have realized that Apple has increasingly moved to using planned obsolescense in a scheme where it is hard (and expensive) for the user to repair instead of buying a new device (see Louis Rossman). For example, my 6 year old MacBook Pro has started showing its age. On my previous laptops, I have been able to perform basic hardware maintenance (replacing thermal paste, cleaning the fans, etc) and make hardware upgrades (SSD, RAM, etc). However, with Apple, some of these tasks are now impossible. The practice of planned obsolescense is expensive and harmful to the environment. So, in the last year, I’ve begun looking for a different tech stack.

I use my computer to browse the web and build web apps, so I only need a basic device. Any laptop from the last 5 years should be able to handle these tasks. Here are a couple requirements I had:

15” inch screen
Quad-Core Processor
4-5 hours of battery life
Easy to repair (can upgrade SSD and RAM)

With these requirements, I bought a second-hand Thinkpad T580 from eBay for $130. A lot of companies lease Thinkpads and sell them after 4-5 years, so these laptops and their parts are widely available on the market for low prices. When the laptop arrived, it looked almost brand new and had less than 100 cycles on the battery. Also, the keyboard did not disappoint!

For the operating system, I wanted something lightweight and flexible enough to tailor the OS to my needs. Arch Linux was an ideal choice. I’ve kept Windows 10 dual booted just in case an application does not run on Linux.

In the last couple days, I’ve really enjoyed using Arch Linux! Arch basically only ships with the Linux kernel and a package manager called pacman. I love having the flexiblity to add exactly what I need and nothing more. I am not beholden to some corporation. I can build and maintain my system however I want.

One unexpected benefit is that I feel more connected to my operating system and the FOSS world. I have found that working on Arch makes me more motivated to write software for fun. On MacOS, everything felt too locked down, but on Arch Linux I feel free to read and modify source code with other programmers. For example, I wanted always wanted my laptop to stop charging at 80% to better protect my laptop’s lithium ion battery. In MacOS, this required some pretty complex code to modify the SMC (feel free to read this battery management app - AlDente source code). However, Linux has a nice interface where you can write an integer between 0 to 100 to a file to set charge limits (see TLP).

It’s early days for my transition to Arch Linux and I expect a lot to change. I am wary of whether apps are supported on this distro. I mainly use webapps and avoid propietary software where I can. However, I have run into situations where Linux is not supported. For example, to take my AWS certification exam, I needed to be on a Windows or Mac system (PearsonVue Requirements). To deal with these situations, I have kept Windows dual booted just in case.

If you have any recommendations for transitioning to Arch, feel free to email me. I’ll share on my blog anything useful I learn during my journey. I plan to daily Arch and read “How Linux Works, 3rd Edition: What Every Superuser Should Know” by Brian Ward.

Model Building (not ml)

2024-07-19T00:00:00-07:00

“What is truth? How do you seek truth?”

These two questions represent the theme of questions that plague my mind throughout my day as I take on roles in my day to day life. As an engineer, I often ask “how do I design this system be reliable and scalable? why does this problem re-occur?”. As a foodie, “where can I find tasty food?”. As a human, “how can I live a good life? how can I be a good son/friend/co-worker?”.

To me, it seems that truth can be both objective and subjective. The truth is often objective in the sciences and engineering, and subjective in the humanities. However, engineering does have subjective truths as well. Engineering systems are tools built by humans, and every engineer has their subjective experience building and maintaining those systems.

I don’t think we can actually obtain truth. Similar to Plato’s Allegory of the cave, we can only see the effects of our action, not the cause. At best, we can build models of what we can see. These models are useful in making decisions. Without them, we would get decision paralysis. We just need to careful that these models are not the truth and not be suprised that they can fail.

I have two processes for model building:

Taking other people’s models.
Build your own: Observations -> Model -> Test if Model Aligns with Future Observations

Each model building process has their own pros and cons.

Taking other people’s models. Pros: Faster and easier to take models. Cons: Other people’s models may not work for you. May be biased.
Build your own: Observations -> Model -> Test if Model Aligns with Future Observations Pros: Tailored for your unique situation and personality. Cons: Built on only one person’s experience. Takes a longer time than taking someone else’s model.

I tend to incoporate both processess in building my own models. When it comes to more objective truths, I usually take other peoples’ models and refine them based on my own experience. With more subjective truths, I build my own model but take into account other people’s model.

An important part of model building is getting more observations in quantity, quality, and diversity. While I use the word “observation”, you need to also act to create observations. Your model drives how you act and actions creates observations. So, depending on you act, you will only get a subset of all possible observations in the world. For example, if you repeatedly run into a wall, you might conlude that it is impossible move beyond that wall. However, if you climb, you may discover that you can scale and get past that wall.

Be cognizant about what models you have and how have built them. You may discover new things by trying out other people’s models. Find different ways to build and live. It can be miserable yet fun and revolting yet tasty. The underlying truth is unknown and constantly shifting, so be careful about building keeping your models static.

Book Summary & Reflection - The Anatomy of the Swipe: Making Money Move

2023-12-20T00:00:00-08:00

Ever since I got my first credit card, I have been interested in better understanding how card payments work. There are ~276 million card transactions handled everyday [1]. How are these transactions handled? What is the busienss model of the incumbent businesses? What are the major pain points of the current payments system? How would you create a new credit card offering?

A fellow engineer at Capital One recommended I read “The Anatomy of the Swipe: Making Money Move” by Ahmed Siddiqui to better understand the payments industry. Here, I’ll summarize my key learnings and reflect on what it means for technologists in finance.

Overview

There are four parts required to facilitate a card payment:

A card (physical, virtual, token)
Merchant
Payment Network
Secure internet connection to transmit messages.

When a customer goes to pay at a merchant, a request to approve the transaction is sent in this basic (but not exhaustive) flow: Merchant -> Merchant Acquirer -> Acquirer Processor -> Card Network -> Issuer Processor -> Issuer.

Definition of Key Players:

Merchant - business selling the good or service. If they’re physical, they need a machine to read the card. If they’re virtual, they need a payment gateway. Examples of Merchants: Walmart, Costco, and Amazon.
Merchant Acquirer - Partners with Merchants and provides them the tools and facilities to accept and process card-based payments. Examples of Merchant Acquirer: Chase Paymentech and Global Payments.
Acquirer Processor - Merchant acquirer’s technology partner to connect with payment network. Usually, acquirer processor will have the hardware to connect to the payment network to request approval of transaction. Examples of Acquirer Processors: Redsys, Monext, Elavon
- Some merchant aquirers have built this in-house or may rely on a third-party.
Payment Network (aka Card Scheme) - Provide infrastructure for card-based transactions. Sit between Acquirers and Issuers and pass messages back and forth to enable transaction. Payment networks also set the communication rules and standards. Example of Payment Network: Visa, Mastercard, American Express, Discover
Issuer Processor - Issuer’s technology parter to connect to the payment networks. This technology provider will usually have hardware in their data center and a fast network connection to the payment network to approve or decline transactions. Example of Issuer Processors: TSYS, Galileo, i2c.
- Some Issuer’s have built this in-house or may rely on a third-party.
- Note: Capital One has partnered with TSYS to help process their credit cards.
Issuer - an Issuer or Issuing Bank’s purpose is to underwrite the user by granting them access to a bank account and potentially access to credit facilities. Ex: JPMorgan Chase, Capital One, Citi, and Wells Fargo.

The issuer will then decide to whether to approve or deny the transaction. If approved, the issuer will place a hold on the funds. Later, at the end of the day, Merchant will confirm transactions and include tips, transaction reversals, and refunds. Finally, money is moved between customer and merchant (clearing).

Payment Ecosystem

There are two types of card networks:

Open Networks (Visa, Mastercard) - There are multiple Merchant Acquirers and Issuers. Card network makes money through fees.
- Pros: Good for distribution. Get brand into as many consumers and merchants as possible.
- Cons: Complex to coordinate players.
Closed Networks (American Express, Discover) - Take their own interchange, Acquirer, and Network Assesment fees. Can adjust fee based on Merchant size. Visa accounts for ~50% of all purchase volume. American Express accounts for 13% and Discover for 2%.
- Pros: Revenue per swipe is higher.
- Cons: Total swipe volume is lower since the network is not as large.
For debit cards, each Card Network has a secondary network brand for Pin Debit or Automated Teller Machine (ATM).
- PIN Debit Network. Visa - Interlink, Visa-Net Debit. Mastercard - Maestro. Discover - Pulse
- Durbin Amendment - every debit card must have a secondary unaffiliated network.
ATM Networks. ATM charges user a fee. Issuer of card is charged an Interchange fee by the ATM.
- ATM Networks - Visa - Plus. Mastercard - Cirrus. Discover - Pulse.
- “Free” ATM networks - MoneyPass and Allpoint. Only charges the issuing bank. No cost directly charged to customer.

Flow of Payments within the Payment Ecosystem

	Merchant	Acquiring Bank	Card Network	Issuing Bank	Customer
Payment	+				-
Acquirer Fee	-	+
Network Assessment Fee	-		+
Interchange	-			+
Rewards				-	+

Note: “Rewards” was added by me. What I find fascinating about this diagram is that every player in the ecosystem is incentivized to participate (i.e. every player has a “+”). I wonder - what are the costs of such a payment ecosystem?

Authorizations

Authorization - happens at moment of swipe, dip, or tap at payment terminal. Action places hold of funds on the cardholder’s account or may decline transaction.
Credit cards are often Dual-message Signature transaction - Authorization (message one) happens at the time of swipe. Followed up by Clearing (message two) happens in bulk at end of the night. Sometimes, Clearing message is different from Authorization if tip is included.

Clearing

Clearing - The term “Clearing” used primarily by Issuers. Also called “Capture” by Merchant Acquirers. Clearing happens at the end of the day. Merchant will include tips, transaction reversals, and returns. Merchant confirms transactions are valid and funds are ready to be settled.
Settlement - actual movement of money from cardholder’s bank account (Issuing Bank) to the Merchant’s bank account (Acquiring Bank). Typically happens via Fedwire.
Card Network does not send the full amount. Card Network will
- keep a percentage for itself as the Network Assessment Fee
- take a percentage and pass it on to the card Issuer as the Interchange Fee
Merchant Acquirer charges merchant an Acquirer fee too. This is charged at the end of the month.

Chargeback

Chargeback - when a cardholder doesn’t recognize a charge on a card, they may request their money back through the Issuing Bank. Chargebacks may used when goods and services have not been provided by the Merchant, but the Merchant refuses a refund.
- Chargeback cost merchants $25-35 per transaction. Often, merchants will eat the cost of low-value transactions.
Merchants are encouraged to keep their chargeback rate below 1%. Otherwise, Card Network may remove the merchant.

To help fight fraud and reduce the number of chargebacks, there are a number of technologies in play:

EMV Chip Card - Originally stood for “Europay, Mastercard, Visa” which established the technical standard of encoding card data onto a secure chip on a card. These cards are “dipped” into card terminals. Secured far more secure then data stored on a magnetic strip.
- Merchants that have implemented this standard are not liable for fraud. The issuing bank must eat the cost of fraud here.
3D Secure - Standard for offering cardholders more security in online transactions. Involves the use of a one-time PIN or passcode.

Banks

Banks have 3 purposes - Issue cards. Serve as Acquiring Banks to merchants. Facilitate movement of real money.
- Only banks can issue credit cards. If you would like to issue a card and you are not a bank, then you must partner with a bank.
New challenger banks are disrupting the field. They partner with small banks not subject to Durbin Amendment and its limits on interchange. Neo-banks make money off deposits and debit card interchange.
- Note: Mobile has enabled these neo-banks to proliferate. Many consumers no longer care about the proximity of a physical bank branch when they can access the bank’s app in their pocket. Neobanks compete with traditional banks with various features such as 2-day early payday, better underwriting models for underbanked groups (ex: Karat credit card for content creators), and interest-free secured credit cards.

Taking Payments

Independent Sales Organization (ISO) - ISO is granted a license to sell Merchant acquiring services from a Merchant Acquirer
Payment Facilitator (PF or PayFac) - Layer on top of a Merchant Acquirer. Payment facilitators can onboard very quickly and offer out-of-the-box hardware and software to enable a merchant to take payment.
- Advantages: Fast setup. Fixed pricing. Managed Fraud.
- Drawback: Being a sub-merchant. Fixed pricing can be expensive.
How do payment facilitators make money? Revenue from Software or Hardware. Revenue from each transaction. Pay Merchant Acquirer the Acquirer fee. Pay card Issuer’s interchange. Pay Network network assessment fees. PF can aggregate transactions and negotiate low Acquirer fees.
Using a Merchant Acquirer
- Advantages: Being a Direct Merchant. Hardware and Software options. Interchange Plus Pricing. Can use flat fee or Interchange Plus. Faster Funds settlement.
- Disadvantages: More paperwork. Mange fraud directly.
How do Merchant Acquirers make money? Revenue from Hardware. Revenue from each transaction.
Payment Service Provider - Aggregator of payment methods. Allows website to get paid via debit card, mobile wallets, and financing schemes such as Buy Now, Pay Later.

Making Payments

Co-Brand Partner - typically a brand or company marketing the card.
Program Manager - manages the day-to-day operations of the card program including settlement, fraud management, and maintaining the relationship of the Issuing Bank, card manufacturer, card network, and cardholder.
- Makes money by getting portion of interchange
Issuer Processor - connection between card and network. Needs to parse an ISO8583 message (standard card transmission) and respond in 3 seconds. Licenses a piece of hardware from the Network commonly referred to as a Mastercard Interface Processor (MIP) for Mastercard and a VisaNet Integrated Processing (VIP) for Visa.
- Also responsible for integration with co-brand. May provide alerts or ability to turn on/off card. Issuer Processors provides APIs and documentation.
- Makes money as a utility based on number of cards or on each transaction.
JIT Funding - Forward details from card swipe for approval.
Issuing Bank - works with Program manager to provide the settlement and bank accounts
- Makes money by possibly charging Program Manager fees for setting up bank accounts, performing compliance audits, and general oversight. Issuing Bank also sponsors BIN. Card networks only give BIN to banks.

Know Your Customer (KYC)

KYC - Practice in banking or finance used to attach identity to a user of a product
For a better user experience, you should progressively ask for more and more information from the customer on a need-to-know basis. This step is critical especially for underbanked customers.

Credit vs Debit Card

23% of millennials don’t carry credit cards (TD Bank’s Annual Consumer Spending Survey)
- Prefer credit cards for simplicity of seeing spending balance
- Note: I believe there’s an opportunity here to help millenials / generation Z better to better understand and feel secure with their money. The increased costs of higher education (and their associated student loans), increased cost of living, and skepticism towards social safety nets such as Social Security are contributing factors to feeling anxious about one’s personal finance.
Some industries such as travel prefer credit cards because funds are guaranteed by the Issuing Bank

Interchange

Durbin Amendment - Banks with assets greater than \$10 billion are regulated for Interchange fees. Regulated Issuers get \$0.21 + 0.05% of the transaction amount + \$0.01 for fraud.
Merchants who clear faster qualify for lower Interchange rates.
Track 3 data - Some merchants provide receipt level details (Track 3 Data) to card networks. When this data is provided, lower interchange is charged.

Other solutions for merchants to lower interchange fees:

Private label cards - Some merchants offer their own cards to use at the store. These cards only work at the store that issued the card. Benefits include direct access to customer spending data, reserves spot in customer wallet, no interchange paid to issuer, little or no fees paid to acquirer, brand loyalty.
Co-Brand Cards - Brand partners with an issuing bank and card network. For example, Amazon partnered with Chase and Visa to create an Amazon credit card. Since Amazon arranges this deal, they will typically get lower network assessment fees with the network and lower interchange with the issuing bank. The brand and issuing bank can also earn interchange on transactions outside of brand.

Moving Money without Card Networks

ACH
- Over 82% of electronic payments in the US are ACH (Automated Clearing House)
- ACH is a technology offered by the “Clearing House,” which is a nonprofit organization. This is a network of banks that have come together to enable movement of money interbank through the use of bank account and routing number. This is a batch process.
- Efficient and inexpensive. However, it’s not the fastest. Since it’s a batch process, there are “cutoff windows”
- Direct Deposit - type of ACH transfer that typically comes from an employer into an employee’s bank account. Employers would give Fed NACHA file to transfer the funds. This NACHA file is often sent earlier but funds are only moved on effective date. However, some banks are willing to loan money for 2 days once they see the NACHA file from the employer.
- NACHA is pushing for faster payments by offering Same-Day ACH
Peer-to-Peer and ACH
- Venmo - Venmo, using Plaid, can see the state of the bank account. Venmo can safely float the funds for a couple days as it waits for ACH However, to the sendee, the funds are sent immediately.
- Zelle - Groups of banks share a ledger. Money moves quickly and does not wait for “settlement”.
Wire Transfer
- Way to move money (usually large dollar amounts) from one bank to another securely and quickly by using account and routing numbers of the sending and recieving banks.
- In the US, the Federal Reserve provides Fedwire which is the primary way to wire funds between banks. This is the supported by almost every bank. The Clearing House also provides a wire service called Clearing House Interbank Payments System (CHIPS).
- Movement of money is instant. However, humans need to confirm that money has moved.
Real-time Payments
- Wire transfers are used for large transfer, but can also be applied to smaller payments. Main barrier is that cost of wire is high because humans need to confirm money movement.
- Real-Time Payments (RTP) - way to push money within seconds by sending money directly to a bank account offered by The Clearing House.
- Cost is capped to $0.045 per transfer

How do you create a credit card?

As an engineer working within the Card division at Capital One, I have thorougly enjoyed learning about the various parts of the payment ecosystem. I don’t know a lot about payements yet, but I am learning more everyday.

One observation that I’ve had is that most of the innovation happens on either end of the swipe — closest to the merchant or customer. The underlying core infrastructure often remains stagnant on old technology (however, this is changing too. re: peer-to-peer payments, decentralized ledgers, blockchain). At Capital One, the company is focused on the customer side of payments. One driving question I’ve been asking myself is, “What valuable niches exist in the market and how can Capital One design a credit card for that niche?”

There are many ways to spot niches in the market. You can divide the market up into new to credit, mainstreet, premium, and ultra-premium. You can look at consumer vs. business cardholders. Or, you can also look at the payment market from the merchant side too and examine co-brand opportunities.

Once you have identified the niche, you need to create a monetized product to cater to those consumers. At Capital One, there are two main sources for revenue in credit cards: credit card interest (16.6 billion USD in 2022) and interchange fees (4.6 billion USD in 2022) [2]. The credit card must offer the consumers compelling benefits in order to 1) use the credit card (interchange fees) and 2) revolve and eventually pay off a credit card balance (net interest income).

Now, you need to create the credit card. This involves designing the credit card benefits, advertising the card, handling customer applications, underwriting each application, obtaining capital to loan, and servicing the card. Fortunately, Capital One (and other credit card companies) are platform companies and they have already built a lot of the tools in-house. They benefit from economies of scale. They have low-cost to access capital through their own customer deposits. They have advertising partnerships with top athletes. They have pre-existing web, mobile, and physical channels for applying and servicing a card. Lastly, they are a brand to whom you would search for other financial products such as another credit card, a high-yield checking/savings account, or a car loan.

Similar to how AWS has become the platform to power the web, Capital One needs to become the platform for consumer financial products. As software engineers, software will play a critical role in 1) automate processes such as application, issuing virutal cards, underwriting each application, analyzing fraud risk, and marketing new cards and 2) analyzing large amounts of data to better underwrite and create additional financial products.

Sources:

[1] The Federal Reserve Payments Study: 2022 Triennial Initial Data Release. https://www.federalreserve.gov/paymentsystems/fr-payments-study.htm

[2] 2022 Capital One Annual Report. https://www.capitalone.com/investor/financials/annual-report/

What I learned working with a Software Engineer with 10+ Years of Experience

2023-12-08T00:00:00-08:00

Today, I worked with Kien Do, a Software Engineer who has over 10 years of experience. Here are four principles I noticed while working with him that has helped me be more effective:

Take your Time: Rushing through a task will only lead to more debugging later. Plan and do it right the first time. This will help you build confidence in your codebase. If needed, communicate to your manager that you need more time to complete the task.
Thoroughness is key: Follow each step diligently and ensure correctness at every stage in development. This includes syntax, understanding system changes, and confirming successful implementation.
Comprehensive Understanding: Think deeply about the system and consider every interaction within the code. Try to save time and effort by addressing potential bugs before they manifest.
Collaborate and Continuous Refinement: Upon completion of work, view the code of other software engineers and see what you can improve. Further, seek to teach others and you will understand your own systems better.

Working with more experienced engineers has helped me learn new things that I cannot learn from reading a textbook or taking a class. These principles likely have come from experience and trial & error. I am appreciative of being able to save some effort and learn directly from these more senior engineers.

Editor Note #1: After showing this post to Kien, he pointed that with respect to “Comprehensive Understanding”, there is a balancing point between moving fast and fully understanding a system. An good engineer is able to balance these two competing priorties.

Editor Note #2: 9 months later, I have read back this post and can summarize it into two sentences.

Don’t be a code monkey - design before you type (however, don’t overdesign either).
Continously learn and refine your system.

Vector Databases

2023-08-20T00:00:00-07:00

What are Vector Databases?

Vector databases are databases that store vectors. Their core function is semantic search — if you have an input vector, you can find the top k most similar vectors in the database.

Why Vector Databases?

Large language models (LLMs) are a type of generative AI that generates text. LLMs are trained on vast amounts of text data, but they have two limitations:

LLMs are trained on a lot of data, and compress that vast training data into their limited memory. They lose some information in this compression.
LLMs do not have access to proprietary information. LLMs are only trained a publicly available data.

To fix these two issues, one popular approach has been to feed information the LLM needs into its context [1]. LLMs have “good” reasoning ability [2], and will use this context to craft a more helpful and specefic response. Vector databases are used to store this context information.

How can they be used in business to automate process X?

Vector databases are used to give more information to LLMs outside their training data. For business, this often includes their own proprietary data. Let’s walk through a use case.

Assume a business conduct sales through email. They hire salespersons with varying levels of abilities. The best salesperson can sell 40% more products than the average employee. As an ML engineer, we can design a system to help the average salesperson obtain the sales figure of the best salesperson. Here’s how:

The business has all the data on a customer email and how their best salesperson responds. These pairs of emails are examples that we can use. We insert these email pairs into a vector database. Now, when the business receives an email from the customer, the system can find similar email examples in the vector database, pass it into the LLM’s context as examples, and ask the LLM to generate a new email to respond to the customer using these examples.

Using this system, every salesperson in the company can be brought up to the level of the best salesperson.

Where can I learn more?

I find LLMs and vector databases are fascinating too! There are many resources to learn more about them. Here are a few:

If you want to understand the landscape of vector databases and their technical details, checkout The Data Quarry’s series of blogs: https://thedataquarry.com/posts/vector-db-1/

If you’re interested in seeing a tutorial for this use case, the YouTube channel AI Jason has an excellent video for this: https://www.youtube.com/watch?v=c_nCjlSB1Zk

Conclusion

Vector databases are used to store additional contextual information for an LLM. This can include information more specific to the problem such as sales data. It is important to note that vector databases are used for more than just LLMs.

LLMs have potential to not only automate processes but also close the skill gap between senior and junior employees. They can also help retain proprietary knowledge when senior employees leave. In business, ML systems can help automate processes and improve the productivity of their employees.

Notes:

[1] The other common approach is fine-tuning.

[2] LLMs are known to make reasoning mistakes. It is still unclear whether current LLMs are capable of true reasoning. However, this will likely improve over time with more data and better models. Check out these research articles: https://arxiv.org/pdf/2212.10403.pdf and https://arxiv.org/pdf/2303.12712.pdf.

Welcome to the new blog

2023-07-15T00:00:00-07:00

Welcome to my new blog! Here, you will find my thoughts and things that I have learned. Enjoy and feel free to reach out with any questions or comments.