Case Study: Sell-Side Algo Credit Trading

Credit and Rates Trading Workflow Automation Best Execution Pricing and Liquidity Risk Analytics

Overview and Current Process

Sell-side dealers and buy-side asset managers are rapidly embracing artificial intelligence applications to price fixed income securities algorithmically in live trading environment or for purposes of end-of-day reconciliation. The current fixed income capital market data flows are inefficient in many respects to enable robust coverage and precision for AI bond pricing. Markets remain heavily reliant on segregated and manual data operations between counterparties creating disparate data sets. These disparate data sets cause the market to suffer from information asymmetry and decentralization. As a result, insight from available data is fragmented and disseminated through manual exchanges between counterparties, which furthers the creation of disparate data sets.

Current Sell-Side Credit Trading Process

Two variations on how the trader receives an RFQ for a particular bond:

  1. Electronic RFQ - Through a Bloomberg (or other) terminal window
    1. Electronic RFQ windows appear on top of each other as per the time when the RFQ was sent, along with a timer to expiration (usually several minutes).
    2. The trader can confirm and submit the price suggested by the RFQ, based on his/her knowledge of the bond market gathered during the day
    3. The trader may also check a third-party pricing source to validate the pricing on the RFQ, or use his/her judgment to adjust the price higher or lower before submitting.
    4. The trader also can choose for some RFQs to expire if there is no interest in quoting back.

  2. >Manual RFQ - Through a phone call
    1. The security is checked on the Bloomberg terminal (or other) with the ISIN, as well as the “Quotes” on the terminal from other dealers, a thought process is used to evaluate the possible price to be quoted, another pricing source might be checked for the specific security or similar securities and a price is given.
    2. The phone call is treated as a priority and the electronically received RFQs are put on hold until a quote has been given.

It is quite evident under time pressure to respond to RFQs that this process entails a trade-off - does the trader let the RFQ expire and sacrifice trading volume as he is not sure of the price or does he sacrifice margins to make sure that he is seen as the market maker for that bond?



Challenges With The Legacy Systems And Processes

When the RFQ is received by the trader via the Bloomberg (or other) terminal or a phone call, there are 4 main methods of pricing the bond:

  1. CBBT taken from Bloomberg covering most of the cases
  2. Fixed Yield – for the special type of the security
  3. Price Range – a certain range is added
  4. A certain spread range – this might be quite wide

The decision on which of these methods should be used, or whether the CBBT price is good would have to be taken by the trader on a case by case basis. A loss of RFQ may occur due to the manual and time-consuming nature of checking fixed yields and certain spread ranges. Comparing the bond that is mentioned in the RFQ, with a similar bond from a peer, would provide good insight into the likely price of the bond, however, this would also take time.


Trader Workflow Objectives And Constraints:

  • The time window within which the trader is supposed to respond to an RFQ is 3 to 5 minutes; however, 1 to 2 minutes is considered ideal.
  • The current cover (difference between accepted trade price and next best quote) is 5 to 10 cents.
  • Trader must deal with approximately 300 to 700 RFQ’s per day
  • On average, around 30%- 40% of RFQs are responded to with a hit rate of 10%– 12%

Reasons driving the problem

The primary problem that the trader faces is due to the low confidence in prices suggested by Bloomberg CBBT and third-party applications that are currently used:

  1. Bloomberg CBBT
  2. The price suggested by Bloomberg CBBT is a composite price based on the most relevant executable quotations on Bloomberg’s Fixed Income Trading platform. The composite is based on current market activity for that bond. If the bond is illiquid, for whatsoever reason, the CBBT price would not represent the true price. This is especially the case when dealing with high yield bonds or bonds with varying liquidity profiles.

  3. Current third-party pricing provider
  4. The pricing methodology is based on a certain pre-defined methodology. The trader’s confidence in this system is low as the pricing methodology is not dynamic and this leads to the suggested pricing is incorrect. As perceived by the trader, there is a large difference in cover (the difference between accepted price and next best available quote) when the deal is won (5-10 cents)

These factors lead to low confidence in the suggested prices and traders must constantly spend a great deal of time and effort in manually adjusting prices based on prior knowledge and intuition. The major trade-off is thus accuracy versus time, leading to missed deals and direct downward pressure on desk P&L.



Can AI-Powered Bond Pricing be a solution?

AI Advantage over Statistical methods

COBI-Pricing, Overbond AI modeling techniques share many similarities with classic statistical modeling techniques starting from the fact that they both deal with volumes data. However, the key difference between statistical techniques and AI models Overbond applies is the goal of these approaches. While statisticians start with a set of known assumptions that are given to the model and best explain the expected behavior of the financial outcome in consideration, AI techniques rather aim at finding by themselves the method (with underlying assumptions that are unknown) that best predicts the outcome in consideration.


Clients can automate trading workflows using Overbond AI bond pricing feeds

Overbond COBI-Pricing, AI bond pricing feed can price bonds automatically with live refresh rates measured in sub 5 second range enabling automated trading workflow. Models can price 30% more bonds with low liquidity profile greatly increasing coverage and pre-trade best execution capabilities. Price is assigned for each bond under the issuer company and spread or yield curve is constructed. COBI-Pricing can systematically price a large number of liquid and illiquid securities, issuer names, and identify pricing tension metrics across large coverage book systematically.

COBI-Pricing was created as part of Overbond’s suite of AI algorithms for the fixed income capital markets. It algorithmically finds the most optimal indicative new issue bond price as well as relative value secondary market bond price for global IG and HY bonds, utilizing machine-learning (ML) algorithms. The ML algorithms analyze millions of data points related to factors such as historical pricing trends between similar bonds and similar issuers, intra-day pricing volatility, trading volume and counterparty composition, company fundamentals trend, investor sentiment, and industry, rating or tenor cluster comparables. Data is aggregated from multiple types of data sources including:


AI Application Business Objectives
Transactions data Transactions occurring in the secondary market, and historical issuance spreads
Investment banking data Fundamentals on corporations, their balance sheet indicators, proprietary data sets treasury groups of the corporations themselves had on file such as dealer quotations and trade points
Proprietary data Client book data and/or direct access to large community of issuers and institutional investors via established feedback loops



Project Structure

Overbond structures sell-side deployment projects in two phases. Overbond first deploys and tests end of day data on a smaller universe of ISINs to algorithmically find the most optimal best executable secondary market bond price for each bond utilizing machine-learning (ML) algorithms. As mentioned before, Overbond ML algorithms analyze millions of data points related to factors such as historical pricing trends between similar bonds and similar issuers, intra-day pricing volatility, trading volume and counterparty composition, company fundamentals trend, investor sentiment, and industry, rating or tenor cluster comparables. Data is aggregated from multiple types of data sources and models are computationally intensive. That is why intra-day pricing is approached as a Phase 2 deliverable of RFQ automation projects.

Phase 1 Deliverables (Duration: 1 month)

  1. Test web access to Overbond’s COBI pricing platform for a predefined list of 100 corporate issuers
  2. Delivery of EOD Mid-Spreads as per the COBI Engine Output for agreed ISIN universe in the form of a table in a flat csv file
  3. Delivery of calculated Bond prices based on the COBI output indicative of the mid price
  4. Error/Difference – Analysis (COBI Price, BBG I-Spreads, Quoted Prices, Traded Prices, Covered).

Phase 2 Deliverables (Duration: 3 months)

  1. Test web access to Overbond’s COBI pricing platform for a predefined list of 100 corporate issuers

    Front end interface developed to incorporate side by side visualization of the output of the finetuned and calibrated intra-day COBI pricing model

    • Intermediate results delivered with all ISINs/securities considered for intra-day pricing, confidence level of the modeled price and liquidity score for each ISIN
    • Input variables, such as aggregated transaction levels, fundamental data, rating and other

  2. Delivery of EOD Mid-Spreads as per the COBI Engine Output for agreed ISIN universe in the form of a table in a flat csv file
    • Bond Buyer Matching module output to serve as an input for the Margin Optimization
    • Modelling trained on the trading books 2 year record
    • Results of AI model for Margin Optimization minimize the Distance-to-Cover for every RFQ on top of the best executable price outputted by COBI-Pricing model


How Overbond COBI-Pricing Algorithm works

The diagram below and the following paragraphs provide a description of ho the Overbond COBI-Pricing algorithm works.


Data Intake & Pre-processing

The Overbond platform sources raw trading live date and fundamental data via Refinitiv’s live platforms. Our data sources include Thompson Reuters, S&P Global Market Intelligence (company level fundamental data) as well as various other sources. Overbond sources primary dealer quotations from a community of large IG issuers and investor preferences through direct feedback loops.

This raw data is then structured in the Overbond databases. Trading data and fundamental data are structured and mapped to the appropriate issuer ID. The data is systematically scrubbed for anomalies and null values. Finally, a set of key input factors are generated based on the raw input. These include but are not limited to factors that measure secondary market spread movements, recent issuance pricing levels, nearest neighbor credit ratings, and fundamental financial metrics. These factors are divided between sector and company-specific and are used as inputs to the machine learning models.

Model Training

The subsequent stage for the machine learning algorithm is to train and apply several models to calculate the output pricing levels. An Ensemble Learning strategy is used in three phases, meaning multiple models are combined to elevate overall robustness at each training stage. These models are each trained using a subset of the past data, ranging from one day, one month to a maximum of ten years. Advanced sampling techniques were used to account for data gaps for illiquid issuers in order to construct yield curves for all tenors and all issuers in coverage universe.



COBI-Pricing Data Intake

Successful data pre-processing is the key stage and pre-requisite for the COBI-Pricing algorithm operation. The precision of the algorithm output is critically dependent on the accuracy, timeliness, and relevance of the pre-processed input data. Overbond sources raw data from major data suppliers in the financial sector, including Refinitiv, Ice, The Six Group, EDI, MarketAxess, Tradeweb, Euroclear, Clearstream, DTCC, CDS, S&P Global Market Intelligence, major credit rating agencies, as well as other sources. The data COBI-Pricing algorithms uses includes the following:


Pre-processed Data Source Update Frequency Relevance
Secondary market transactions data Refinitiv, Ice, The Six Group, EDI, MarketAxess, Tradeweb Intraday Live prices and yields of companies’ bonds are used to measure spread movements in the coverage universe.
Settlement layer data per ISIN liquidity profile Euroclear, Clearstream, DTCC, CDS Intraday, end of day and historical Settlement layer data when adjusted and merged with correct trading time stamp can augment the view into true liquidity of the particular ISIN as it contains settled trades by counterparties that were executed on the OTC level and otherwise do not appear in other consolidated data feeds.
Nearest Neighbour Credit per Rating and Industry Sector Refinitiv, Ice, The Six Group, EDI, MarketAxess, Tradeweb, DBRS, Moody’s, Fitch, S&P Weekly updates, quarterly filing cadence Issuer’s industry sector cluster and past bond issuances and their ratings as well as composite rating for the issuer overall indicate the company’s risk level and benchmarking category. They are used to train the models and to backtest the accuracy of COBI-Pricing output.
Fundamental Financial Metrics S&P Global Market Intelligence Weekly updates, quarterly filing cadence The company’s fundamental financial data is an indicator of the company’s credit-worthiness, and by extension, their cost of borrowing across tenors reflected in the bond pricing. In addition, fundamental metrics indicate the liquidity need of the company and its short term need to raise financing as well as leverage ratios. The financial profile of a company aids with clustering analysis of companies with similar characteristics.


COBI-Pricing Model Training for Different Liquidity Profiles

COBI-pricing is an advanced three-phase AI algorithm engineered to measure best-fit correlations with respect to company fundamental valuation and secondary market pricing for their bonds across sector peers and market conditions at large. Models are tuned for different liquidity scenarios. A variety of pre-processed inputs flow into COBI-Pricing’s algorithm, to generate bond pricing output.


The first phase of the algorithm generates pricing curves for a list of companies specified by a domain expert with highest liquidity profile. This list contains companies from diversified sectors and are frequent issuers with liquid outstanding secondary market bonds (High Issuers). The Issuer should have a minimum number of bonds outstanding, bonds outstanding across the curve, and minimum number of trades and daily volume in secondary market for the algorithm to build a High Issuer curve using SVM algorithm.

The second phase uses a Nearest Neighbors algorithm to generate ISIN pricing curves for issuers with illiquid or insufficient secondaries (Low Issuers). Peers for each Low Issuer are identified using a score based on fundamental financial metrics, industry sector, credit ratings, secondary spreads, and issuances. The top High Issuer with the lowest blended score vis-à-vis a Low Issuer is classified as the peer. The secondary data from the top peer, along with the secondary data from the Low Issuer, is used to form an enhanced dataset for phase three to build curves.

The third phase creates pricing curves for all Low Issuers using Support Vector Regression models within SVM algorithm family, on the combined secondary data set of the Lower Issuer and the top peers as derived from the second phase.


COBI-Pricing handles the problem of sparse data sets, by filling the data gaps using credit-matched peers with pricing levels to arrive at best fit or best executable prices for securities. Illiquid Companies with only minimal trading activity will now have modeled and relative-value prices for secondary market securities across all tenors. Their sparse data sets are enhanced with data from their peers, as determined in phase two of the algorithm.



COBI-Pricing Intra Day User Interface

COBI-Pricing AI output (data-feed) can be refreshed real-time or on an end-of-day basis depending on the user need. COBI-Pricing runs on the big-data set sourced from Overbond data lake. Overbond product team works with clients to customize coverage baskets based on trading style, models are then trained and backtested utilizing all data sources. The COBI-Pricing output can be integrated into a data feed via API, presented as custom visualization, or viewed on the Overbond Platform as a downloadable table.

User Interface – Live Market Pricing:

  1. Primary Market - Standard tenor live visualization
  2. Secondary Market - ISIN pricing live visualization
  3. Issuer curve live visualization


COBI-Pricing Intra Day Data Output

COBI-Pricing AI output via data-feed, API access, can be also refreshed real-time or on an end-of-day basis depending on the user need. COBI-Pricing runs on the big-data set sourced from Overbond data lake. Overbond product team works with clients to customize coverage baskets based on portfolio strategy or trading style, models are then trained and back tested utilizing all data sources. The COBI-Pricing output can be integrated into a data feed via API, presented as custom visualization, or viewed on the Overbond Platform as a downloadable table.

Output Schema for Secondary Market Pricing

Trade Time Issuer ISIN Benchmark (bps) Spread (bps) Yield (bps) Price

Output Visualization for Designated Portfolio

Trade Time Issuer ISIN Benchmark (bps) Spread (bps) Yield (bps) Price
2019-09-02 08:38 Vonovia SE DE000A18V146 -54.52 54.10 -0.42 109.66
2019-09-02 09:51 Deutsche Bahn Finance GMBH XS0969368934 -55.91 17.44 -38.47 111.73
2019-09-02 11:57 Carrefour SA FR0013342128 -56.03 47.73 -8.29 103.39
2019-09-02 12:39 Deutsche Telecome AG XS2024716099 -4.34 100.35 96.01 105.71
2019-09-02 12:41 Vonovia SE DE000A1ZY989 -50.14 63.80 13.66 107.57
2019-09-02 12:48 Iberdrola SA XS1575444622 -51.50 32.10 -19.40 106.32
2019-09-02 13:39 LANXESS AG XS0855167523 -56.89 40.81 -16.08 109.00
2019-09-02 14:07 Electricite de France SA FR0011637586 -57.37 29.82 -27.54 104.19
2019-09-02 14:17 Danone SA FR0013216892 -56.69 18.16 -38.53 100.60
2019-09-02 15:07 LANXESS AG XS1501363425 -57.58 35.62 -21.96 100.87
2019-09-02 15:18 Siemens AG XS2049616464 -57.72 20.71 -37.01 100.75
2019-09-02 15:47 Telenor ASA XS2001737324 -47.70 37.99 -9.71 105.52
2019-09-02 15:51 Electricite de France SA FR0011637586 -57.50 29.98 -27.52 104.19
2019-09-02 16:01 Vonovia SE DE000A19B4D4 -57.89 39.93 -17.96 102.24



Backtest Approach

To establish that Overbond's COBI Bond Pricing algorithm deterministically constructs a fair value curve for a set of coverage issuers and can accurately price ISINs that are within the client’s coverage and with satisfactory precision and coverage. The set of coverage issuers and ISINs has been selected to represent a diversified universe across issuers and ISINs with a liquid day-trading pattern, different ratings/risk profiles, and bonds across curve. Backtest description as per below.

Test Description

1. Measure precision of the fair value curve output from the algorithm by comparing algorithm output prediction for a specific issuer and tenor with actual secondary levels with same specific issuer and tenor bond

2. Test timeframe is all trades that happened in designated 2 time periods as defined (1st Sep 2019 – 15th Oct 2019) and includes a total 500-800 ISINs

3. Test actual traded levels on the day as a most-relevant comparison to modeled level performance by measuring the difference and comparing on a relative value basis difference with average bid-ask spread for that name and tenor (aggregating all trades on the day for the same ISIN/CUSIP)

4. Use reference data from secondary trades in individual bonds selected as a comparison value to the corresponding modeled algorithm price. Do not compare algorithm output to the data-set that was used as algorithm input to preserve integrity in all test cases.



Validation questions back-test should answer:

  1. Does the model perform consistently and deterministically in building credit curves?
  2. Is the accuracy of the model output validated by the fact that the difference between model individual ISIN price output and actual secondary trade is within an acceptable range (ie. 10 cents in price terms)?
  3. Can the model perform well in pricing ISINs with very limited trading activity / liquidity profile?
  4. Does the model construct accurate credit curves for issuers with few outstanding bonds or with illiquid bonds by using Nearest Neighbor AI approach?
  5. Does the Nearest Neighbor algorithm ensure validity of relative value argument across sector and credit quality, effectively guaranteeing that best possible peers /comparables are utilized in pricing benchmarking?
  6. Does the application of Support Vector Regressor (SVR) which uses non-linear regression to build credit curves, effectively ensure that no significant pricing curve distortions exist when longer or shorter tenor pricing is determined across the curve?

In order to test the yields suggested by COBI, the desk defines various metrics and tests that would compare the prices generated from the yields to various reference prices from legacy proprietary systems and Bloomberg. The benchmark set by the sell-side shop is that prices suggested by COBI built curves should be within 10 cents of trader price and ideally within trader price and legacy system suggested price.



Back test visualization approach:

  • COBI-Pricing model provided spreads converted to price through the pricing library and stored them in a local database both with spread and price levels
  • The results are joined with various reference data elements including reference prices, RFQs, Tradebook, Bloomberg CBBT to provide confidence in the way COBI-Pricing functions and can deliver results that can be used and trusted by the trader
  • The results are visualized to allow the stakeholders to interact and understand them

Metrics for error analysis:

Following metrics are used for prices and i-spreads against Tradebook actual trades and quotes, Refinitiv/Bloomberg, RFQ and trades and quotes:

  1. Mean Absolute Error
  2. Mean Squared Error
  3. % within Bid/Ask or Mid/Ask
  4. Number of times trend break was predicted
  5. Error where trend jumps or breaks occur
  6. Performance in Time


Back Test Results

The primary goal of the backtest is to compare prices generated by the pricing engine that uses COBI yields as input and compare them to:

  1. Prices in the trade book (trader adjusted price accepted by the buyer/seller)
  2. Automatic legacy system price (Bloomberg CBBT + Automatic Legacy Adjustment)

The expectation is that prices based on COBI yields should effectively correct the automated system price: i.e. Modeled price – Trader price < Automatic system price – Trader price

In order to visually compare prices, all prices are scaled using automatic system price and scored accordingly. As you can see in the sample graph below displaying results for 18 ISINs, in most cases COBI-Pricing level is within desired blue line range (BBG BID vs. BBG ASK), which is the category that indicates that COBI price is in line with trader’s price.



As we know, not all bonds fit perfectly on an issuer curve, some might be trading off the curve for a number of reasons, while employing a curve-builder approach to price ISINs in the secondary bond market we acknowledge that the model will not be precise for 100% for the entire bond universe. Hence, we segregated the bonds based on various measures including liquidity and volatility to identify bonds which can be priced accurately, and which bonds cannot be due to various market behavior, trading pattern and data availability limitations. COBI-Pricing model output price for each ISIN also has these explainability features attached to it:

  1. Liquidity Score – indicates deep liquidity profile of the ISIN, based on bid-ask spread wideness, trade count and trade volume on the day and in recent history, peer ISIN comparisons, OTC flows from settlement layer data, price volatility intra-day and monitoring of the distribution of all corelated factors
  2. Confidence Score – measures confidence of the modeled COBI-Pricing output price for each ISIN is within a pre-defined threshold, this was set to be within 10 cents on the price basis above
  3. Confidence Interval – same as above but provides a range for trader to consider the high and low point


Business Impact

Over the past couple of years, we have witnessed profound changes in the fixed income marketplace with counterparties increasingly adopting quantitative investing and liquidity risk monitoring techniques to automate their trading workflows. These include systematic algorithmic trading and liquidity risk management automation, merging of fundamental discretionary and quantitative investment styles, consumption of increasing amounts of alternative data, and adoption of new methods of pricing analysis for fixed income instruments such as AI analytics like Overbond COBI-Pricing algorithm.

Specific use cases for the COBI-Pricing algorithm application are examined to identify business objectives and key benefits below. Overbond client organizations include sell-side institutions with significant trading volumes (200-500 RFQs+ a day per trader). Their innovation groups actively explore new technologies that can serve as the catalyst trading automation and improved risk management, trade flow, pre-trade and post-trade analytics.


AI Application Business Objectives Key Benefits
Intelligent automation and enhanced decision-making
  • Automate trade flow and liquidity risk monitoring and reporting

  • Respond to 80-120% more RFQs

  • Preserve optimal hit ratio

  • Significantly increase desk P&L

  • Use auto-pricing models to improve reporting and risk systems

Overbond COBI- Pricing AI models aggregate historical data from multiple sources and optimize pricing for bonds with varying liquidity profiles. Overbond COBI Pricing can enhance users’ bond trading workflow by providing precise executable prices in up to 30% more situations when there is no directly observable trading price in the market. Bond trading execution desk revenue can be grown significantly through their access to this insight.

COBI-Pricing enables fully automated bond trading workflow with various curve visualizations and front-end trade analytics tools that are natively integrated with trader’s desktop.

As a dealer, boost your RFQ hit ratio with pricing feed with margin optimization model add-on, discover deep bond liquidity profile, and predict investor behaviour with buy/sell indicators

Sell-side credit and rates trades can grow their RFQ response volumes significantly with precision and confidence, growing accordingly desk P&L.



Implementation Considerations

Institutions considering AI predictive analytics implementation and big-data transformation projects, can employ acceleration utilizing externally calibrated models and market signals. Below are several key considerations and questions for executives in charge of AI roadmap:

  1. What is the current state of our fixed-income in-house data?
  2. What are our data science and engineering capabilities?
  3. Are we building AI capabilities to grow revenue or cut cost?
  4. How can we redefine the boundaries of our data universe or identify alternative data sources necessary to feed AI engine?
  5. Given that AI learning curve is steep where do we begin?
  6. How do we create and execute AI proof of concept use cases rapidly?
  7. What are key success factors for our AI roadmap?


Custom AI Services
Overbond works with clients to identify and recommend practical AI analytics use cases that are aligned with strategic goals of the financial institution. We help assess current-state AI capabilities, and define roadmap to help clients realise value from AI applications. We manage cross-channel data flows across multiple systems and enable custom font-end visualizations.

Proven Methodology
With our targeted approach and implementation methodology, we quickly demonstrate value of AI analytics to test use cases, enabling client-side change management approach and stakeholder buy-in.

Operational Acceleration
We help clients build and deploy custom AI solutions to deliver proprietary analytics and tangible business outcomes. Our experience combines calibrated models, design patterns, engineering and data science best practices, that accelerate value and reduce implementation risk.

AI Analytics As-a-Service
Overbond helps customers design and oversee mechanisms to optimize and improve existing fixed income credit valuation, issuance and pricing prediction and pre-trade opportunity monitoring using AI. Our team of worldclass data scientists and engineers manage an iterative implementation approach from current state assessment to operational handover.



About Overbond

Overbond specializes in custom AI analytics development for clients implementing trade automation workflows, risk management, portfolio modeling and quantitative finance applications. Overbond supports financial institutions in the AI model development, implementation and validation stages as well as ongoing maintenance.


Contact:
Vuk Magdelinic | CEO
+1 (416) 559-7101
vuk.magdelinic@overbond.com