A short exploration of Solana MEV data

Ghost is a company that allows developers to create custom blockchain indexes using Solidity. They recently published a dataset that contains 1.06m Solana transactions from September 2024 that they identified as atomic arbitrage* transactions containing SOL/WSOL. This article quickly analyzes that dataset. Thanks to Ghost for making this available!

*Atomic arbitrage is a type of arbitrage that takes place within a single set of transactions. For example, someone may notice that the SOL <> USDC and USDC <> USDT and USDT <> SOL prices provide an arbitrage opportunity, and closes it in a single block/slot. This would NOT include CEX-DEX arbitrage, which does not/cannot occur atomically.

The notebook for this article is available on my Github.

Background and Dataset Schema

As some background on the Solana network operation/MEV landscape, Jito Labs develops an alternative Solana node implementation that adds new features to the original “Agave” client developed by Solana Labs (now Anza). The most relevant features for this article are the ability to submit transaction “bundles,” where the client executes transactions received sequentially and atomically, and add Jito tips to transactions, which are side payments for priority inclusion.

Solana network validators can choose to run the Jito client rather than the Agave client, if they want to offer these features. They can also modify the client themselves, as it is open source. Likely because validators earn more revenue when they run the Jito client, 89% of Solana stake weight runs the Jito client as of 10/21/24, according to Jito. This makes the Jito client the de facto primary client implementation for the Solana network. You can see in the Jito commit history that the Anza team contribute to Jito development.

While the Agave client has a concept of a priority fee, the need for Jito tips is described well here. At a high level, Jito takes 5% of tips (corroboration), validators take a discretionary commission (market commission appears to be 7-10% according to Stakewiz), and SOL stakers receive the rest. (as an aside, I believe validators can choose what txns to include. I don’t think consensus requires inclusion based on priority fee, or intrablock ordering based on priority fee).

With that background, here is the schema for Ghost’s dataset, with some analogies to EVM transactions, if helpful.

FieldDescription
sigSignature (akin to tx hash)
slotSlot number (akin to block number)
date_cetDate in Central European Time
in_bundleWhether the transaction is part of a Jito “bundle”
jito_tip_solThe tip to the validator running the Jito client included with the transaction, in SOL. Akin to a priority fee.
revenue_solThe revenue from the arbitrage, calculated by Ghost
signerThe signer of the txn, or the wallet that sent the transaction.
programThe smart contract (called a “program” on Solana) with which the transaction interacted.

Note that everything in this article simply takes the dataset at face value. It may be possible that what is in the dataset may not be strictly the same as what happened on Solana itself (i.e., if the dataset is not accurate for some reason). That said, I have no reason to think the dataset is inaccurate! I just don’t know 🙂

Basic interesting facts

  • The total amount of atomic arbitrage revenue earned by arbitrageurs was 26,991 $SOL, or about $4.5m at current day constant price (10/22/24).
  • The total amount of tips paid to validators running the Jito client was 5,141 $SOL, or about $856k at current day prices.
  • This equates to just 19% of arbitrage opportunity paid to Jito proposers, which may indicate a relatively uncompetitive market. If the market were more competitive, proposers should receive more of the arbitrage opportunity because the slot leader is a monopolist on the economic opportunity.
  • Additionally, just 33.7% of the transactions are included in a Jito bundle at all. I know that it is common practice for searchers to send a Jito transaction AND an Agave transaction, but this is still surprising, especially given Jito’s 89% market share.
  • For transactions sent to Jito, 47.6% of arbitrage opportunity is paid to Jito proposers.
  • The total revenue from transactions in a bundle was 10,800 $SOL, or about $1.8m. This is equivalent to 40% of arbitrage revenue.

These datapoints indicate an interesting and surprising market structure in Solana atomic arbitrage. First, it appears relatively uncompetitive. If the market were more competitive, arbitrageurs would need to tip closer to the total arbitrage profit for a transaction (or else a competitor would tip higher and capture the opportunity). Second, although Jito has high penetration, and a product feature that seems great for atomic arbitrage (bundles), it doesn’t end up being used as much as one might think.

Short exam of opportunities

  • There are 52 unique programs (i.e., smart contracts/apps) present in the dataset. A Solana “program” is like a smart contract in the EVM.
  • The most-used program is JUP6Lk..., the Jupiter aggregator. It was arbitraged against 307k times (1/3rd of txns).
  • The next-most arbitraged-against program is E8uB9x..., with 99k transactions (1/10th of txns).

Here’s a histogram showing the number of transactions per program in the dataset. It’s hard to say that much about these stats.

And some additional scatter plots for number of transactions vs total revenue and total tips per program.

And probably the most interesting scatter, total tips vs total revenue.

Low tips but high revenue are probably the best programs for an arbitrageur; it implies they are capturing most of the arbitrage value available. The “best” programs, circled in red above (the three which yielded >1200 SOL in revenue but required <250 SOL in tips) are described below.

ProgramJito tips paid to validator ($SOL)Total revenue ($SOL)Aggregate % opportunity paid as tipNumber of txns
3JmzqBoDLvNTPapBGCN7x23kTE5o7zkQ2fQhuyU3j9x6153.12,493.16.1%28,090
7PWnthtTsGnSpR4JLENYVoCJ5y5XwgELVbqgp6TkAFaH02,241.50%38,406
bank7GaK8LkjyrLpSZjGuXL8z7yae6JqbunEEnU9FS425.41,339.11.9%95,013
All bundled txns5,14110,80047.6%356,867
Full dataset5,14126,99119%1,059,953

Another interesting finding is that Jito use for these opportunities is relatively low.

ProgramAggregate % opportunity paid as tip% txns in Jito bundle
3Jmzq...6.1%14.5%
7PWnt...0%0%
bank7...1.9%8.3%
Full dataset19%33.7%

One hypothesis for this phenomenon is that just one market participant is running the strategy. And indeed that’s what we find:

ProgramNumber of unique transaction senders
3Jmzq...3
7PWnt...1
bank7...1
Full dataset414

Except for 3Jmzq..., which has three unique senders in the month of September. Did competitors discover the strategy during the month? Here’s the number of unique senders over time:

Doesn’t seem like it. Hmm, let’s see how each signer/sender is performing.

Again, pretty consistent! No big changes in the market for this opportunity over time.

The last thing I wanted to check that might indicate a change in competitiveness was increased use of Jito tips. If the amount of tips increased over time, that could indicate the opportunity set was growing more competitive.

So interesting! 9VP2W... (must?) uses tips to win its opportunities… and it still wins the least of the three addresses!

As to what the program actually does? I have absolutely no idea, I cannot read a Solana block explorer.

Thanks for reading 🙂 These are my independent thoughts and do not necessarily represent the views of my employer.