CALCULATION ENGINES BEYOND EXCEL: PART 5
In Part 4 we generalized our tool and added a command-line interface (CLI) to simplify the workflow. Along the way we incorporated rate files and built our own parser functions read_flat_csv(), read_ia_py_csv(), and read_gen_rc_ia_py_csv(). This base functionality is a great starting point, however, what happens if we need to add more products? Our current architecture is really only built for a single product and there is not necessarily a best way forward. One approach might be to make a “simple2.py” that is for a second product and just tweak what we need (perhaps point to different rate files). Another might be to add a product parameter to our CLI and at_issue_projection() and inject some control-flow statements. In my experience these work fine if there is not much variability but eventually this approach can become rather confusing, error-prone, and suffer from scalability problems as the decisioning becomes more complex.
Abstracting some of the details becomes our next best friend. Instead of injecting the control-flow statements directly into our projection function we could simply wrap the decisioning into more functions. If premium load rates vary by product just create a function that returns the right rates based on the necessary parameters (likely product). This approach might be entirely sufficient, and what we’re really talking about is answering the question ‘how do I reasonably manage my codebase/tool as the problem complexity grows’ which is a tough question to answer with specifics, but the short version is things will change. How can we minimize changes as we need to introduce new functionality?
Up until now we’ve been focusing on procedural/functional approaches but there is another way to design our solution: objects. Object-oriented programming (OOP) is programming paradigm where we structure our solution around classes and objects. You can think of classes as blueprints for what we want to model, they define a new type of thing that can exist in your program. Objects are particular realizations of those types. As a simple example, I might have a Person class where objects of type Person have a name and an age associated with them and I can create multiple Person objects. Objects generally contain data (a.k.a. fields, attributes, or properties) and/or procedures (a.k.a. methods) and will interact with each other in predefined ways. There is a mountain of resources available on this topic so we’ll focus on an implementation for our problem and you could start with Wikipedia if you wanted to learn more and OOP in general. Our objectives will be code reuse (through inheritance or composition) and scalability (through polymorphism and loose coupling).
By the end of this part, your code should look like this.
AN OBJECT-ORIENTED APPROACH
At this point, your project should look something like the project here. Your local repository may also include a venv directory and a __pycache__ but those are ignored by Git so they do not show up in the stored repository.
Currently, our main script for execution is simple.py and we have a CLI to help run different cases interactively or we could simply leverage the at_issue_illustration() in a separate script if we wanted to run a batch of cases programatically. Suppose we wanted to design another solution using an object-oriented approach because we heard it might help with scalability and code reuse, how can we get started if we know little about this paradigm?
Knowing that the paradigm leverages objects which contain data and procedures and objects with interact with each other in the solution we can create a high-level model to inform our solution design. Let’s start with a simple idea: we want to illustrate a Policy which is a defined by both an Insured and a Product. In a picture:
These will be our objects and we’ll work out the exact relationships between them. Our Insured will house the basic characteristics of the person being insured (i.e. issue age, risk class, and gender), our Product will define any insurance product functionality (e.g., illustration order of operations) and rates, and our Policy will combine these to work together.
Let’s create a new file, objects.py, in the same folder as simple.py and create three placeholder classes:
class Insured:
pass
class Product:
pass
class Policy:
pass
We’ve seen this class syntax before when building tests — we have already been using some object-oriented design! Remember, a class is a blueprint for an object. An object is a specific instance of a class and we create these by instantiating them. A bit of a mouthful, and if you are unfamiliar with this paradigm it should be clear by the end.
Now, let’s dive into these one by one.
INSURED OBJECT
Let’s make Insured a bit more meaningful. Update your Insured class to the following for the following so it can act as a container for important information on the person being insured. This is an initialization function that is called when we try to create an instance of the class:
class Insured:
def __init__(self, gender: str, risk_class: str, issue_age: int):
self.gender = gender
self.risk_class = risk_class
self.issue_age = issue_age
Now we can create an instance of class Insured by calling it similarly to how to call a function. We have to provide the associated parameters otherwise we will get an error with one exception: the self parameter is implicitly provided and represents the particular object being referenced. Then we can access the attributes like this:
>>> import illustrator.objects as iobj
>>> insured = iobj.Insured("M", "NS", 35)
>>> insured.gender
'M'
>>> insured.risk_class
'NS'
>>> insured.issue_age
35
The cool part? We can create multiple Insured objects that remain distinct:
>>> ins1 = iobj.Insured("M", "NS", 35)
>>> ins2 = iobj.Insured("F", "SM", 36)
>>> print(ins1.gender, ins2.gender)
M F
You may have noticed that there is quite a bit of boilerplate code in the Insured class and for something like this, Python provides a streamlined way to create the same thing by leveraging dataclasses. We can replace the initialization method (methods = functions that belong to classes) entirely and do the following:
import dataclasses
@dataclasses.dataclass
class Insured:
gender: str
risk_class: str
issue_age: int
class Product:
...
We use the dataclass decorator from the dataclasses module (@dataclasses.dataclass) which will automatically generate an initialization method for us based on the definitions provided. As a bonus it also automatically generates a __repr__ method which is intended for a string representation of the object and can be useful for debugging or logging purposes. As an example, if we attempted to print an instance of Insured before using the dataclass implementation we would see something like the following:
>>> import illustrator.objects as iobj
>>> insured = iobj.Insured("M", "NS", 35)
>>> print(insured)
<illustrator.objects.Insured object at 0x0000018F7B3CC6E0>
Using the dataclass implementation we would see this which is likely a lot more useful:
Insured(gender='M', risk_class='NS', issue_age=35)
Now we could achieve similar functionality by storing this information in a list, dictionary, or some other data structure but the benefit of creating our own class is that we require this information to be present so we can depend on it existing. If we provide an Insured object to some other code we can rely on gender, risk_class, and issue_age being available but if we used another data structure we might assume or have to do more checking to ensure they exist.
PRODUCT OBJECT
We want to create a Product object that defines or owns the insurance product functionality and rates. Our simple implementation for the illustration would gather the rates based on the case parameters and then run the illustration but let’s separate that functionality here into two different methods for flexibility. To gather rates we need to understand the characteristics of an Insured, so we want a method that accepts an Insured and returns all the requisite rates. Before writing the details of that method, or signature might look something like this:
...
class Product:
def get_rates_for_insured(self, insured: Insured) -> ??:
pass
...
We are not quite sure what we want to return from this method. We want all the applicable rates, so why don’t we create a Rates class to contain them and clearly define what we expect?
RATES OBJECT
Let’s build a container for all our rates and a way to access them. We’ll want our class to contain all the requisite rates for the illustration (e.g., premium loads, per policy fees, etc.). Starting with something straightforward we might do this:
...
class Rates:
def __init__(self,
premium_loads: list[float],
policy_fees: list[float],
per_units: list[float],
naar_discounts: list[float],
coi_rates: list[float],
interest_rates: list[float]):
self.premium_loads = premium_loads
self.policy_fees = policy_fees
self.per_units = per_units
self.naar_discounts = naar_discounts
self.coi_rates = coi_rates
self.interest_rates = interest_rates
...
We could also use the dataclass functionality as well, but before implementing that let’s consider that in simple.py we pass rates by policy year to individual functions. Why don’t we set up our Rates object to easily provide rates by policy year? We’ll do this by indicating the instance variables are protected (the Python convention is to have them start with a single leading underscore so premium_loads -> _premium_loads) and then we’ll add methods to access each rate by policy year.
...
class Rates:
def __init__(self,
premium_loads: list[float],
policy_fees: list[float],
per_units: list[float],
naar_discounts: list[float],
coi_rates: list[float],
interest_rates: list[float]):
self._premium_loads = premium_loads
self._policy_fees = policy_fees
self._per_units = per_units
self._naar_discounts = naar_discounts
self._coi_rates = coi_rates
self._interest_rates = interest_rates
def premium_load(self, policy_year: int) -> float:
return self._premium_loads[policy_year - 1]
def policy_fee(self, policy_year: int) -> float:
return self._policy_fees[policy_year - 1]
def unit_load(self, policy_year: int) -> float:
return self._per_units[policy_year - 1]
def naar_discount(self, policy_year: int) -> float:
return self._naar_discounts[policy_year - 1]
def coi_rate(self, policy_year: int) -> float:
return self._coi_rates[policy_year - 1]
def interest_rate(self, policy_year: int) -> float:
return self._interest_rates[policy_year - 1]
...
Why add all this abstraction? Why not just access the underlying parameters and index with the policy year directly? We could, but then our code would not be very flexible and our design would be considered ‘tightly coupled’ which is typically not good. By encouraging the use of some simple methods we can encapsulate the details of our data structure and change them if need be in the future without impacting the overall functionality. We can change the names of the attributes without impacting the functionality outside the class. We can change how the rates are stored internally in the class (e.g., switch to dictionary or some other custom class) without impacting the functionality outside the class. That can be a very powerful ability.
Now, let’s return to our Product object.
PRODUCT OBJECT
Let’s finish the method we started on, it should return a Rates object. Make sure you import the data_functions code as well to help.
import illustrator.data_functions as df
...
class Product:
def get_rates_for_insured(self, insured: Insured) -> Rates:
premium_loads = df.read_flat_csv('./data/premium_load.csv')
policy_fees = df.read_flat_csv('./data/policy_fee.csv')
unit_loads = df.read_ia_py_csv('./data/unit_load.csv', insured.issue_age)
naar_discounts = df.read_flat_csv('./data/naar_discount.csv', 1)
coi_rates = df.read_gen_rc_ia_py_csv('./data/coi.csv', insured.gender, insured.risk_class, insured.issue_age)
interest_rates = df.read_flat_csv('./data/interest_rate.csv')
rates = Rates(premium_loads, policy_fees, unit_loads,
naar_discounts, coi_rates, interest_rates)
return rates
...
We also need an illustration method. For inputs to that method we’ll need rates, a face amount, a premium, and a way to determine the length of the illustration which will be dependent on the product’s maturity age and the issue age of the insured being illustrated. We can pass an Insured object for the latter and for the former we can add a maturity age attribute to the class during instantiation.
...
import illustrator.functions as functions
...
class Product:
def __init__(self):
self.maturity_age = 121
def get_rates_for_insured(self, insured: Insured) -> Rates:
...
def illustrate_from_issue(self, insured: Insured, face_amount: int, annual_premium: float, rates: Rates) -> float:
projection_years = self.maturity_age - insured.issue_age
end_value = 0
for i in range(12*projection_years):
policy_year = functions.calculate_policy_year(i+1)
start_value = functions.calculate_start_value(end_value)
premium = functions.calculate_premium(i+1, annual_premium)
premium_load = functions.calculate_premium_load(premium, rates.premium_load(policy_year))
expense_charge = functions.calculate_per_policy_fee(rates.policy_fee(policy_year)) + functions.calculate_per_unit_load(rates.unit_load(policy_year), face_amount)
value_for_naar = functions.calculate_value_for_naar(start_value, premium, premium_load, expense_charge)
naar = functions.calculate_naar(face_amount, rates.naar_discount(policy_year), value_for_naar)
coi = functions.calculate_coi(naar, rates.coi_rate(policy_year))
value_for_interest = functions.calculate_value_for_interest(value_for_naar, coi)
interest = functions.calculate_interest(value_for_interest, rates.interest_rate(policy_year))
end_value = functions.calculate_end_value(value_for_interest, interest)
return end_value
Now, with everything implemented we could make a few adjustments. The maturity_age attribute should always be the same for every instance so we could make it a class attribute that is shared across all instances and drop the __init__ function. We could make the rate gathering method a static method and drop the self parameter (it does not call any instance or class attributes). If we make maturity_age a class attribute we could make the illustration function a class method. None of that is required and we’ll skip those changes here, but they can help inform the maintainer and the user what level of data is being accessed.
POLICY OBJECT
Now that we have our Insured and Product objects we can create our Policy object. A Policy should take both an Insured and a Product as well as have a face amount. Rates will be locked in for our policies so we can set those in the initialization method and wrap our illustration method like this:
...
class Policy:
def __init__(self, insured: Insured, product: Product, face_amount: int):
self.insured = insured
self.product = product
self.face_amount = face_amount
self.rates = product.get_rates_for_insured(insured)
def illustrate_from_issue(self, annual_premium: float):
return self.product.illustrate_from_issue(self.insured, self.face_amount, annual_premium, self.rates)
This is an example of using composition instead of inheritance. Composition is generally described as a ‘has-a’ type of relationship; our Policy class has an Insured and it has a Product (plus a face amount). Inheritance is generally described as a ‘is-a’ type of relationship, which we’ll see later. A good rule when designing and utilizing classes is not reaching through objects or limiting what methods have access to. The Law of Demeter or Principle of Least Knowledge can help us and we follow the recommended limitations in our design above. With our classes created we can also port over our CLI to use the object-oriented design:
...
import argparse
...
if __name__ == '__main__':
parser = argparse.ArgumentParser(description="Arguments for simple.py")
parser.add_argument("-g", "--gender", default="M", choices=["M","F"], help="The gender for projection, default is M for male")
parser.add_argument("-r", "--risk_class", default="NS", choices=["NS","SM"], help="The risk class for the projection, default is NS")
parser.add_argument("-i", "--issue_age", default=35, type=int, choices=range(18,81), help="The issue age for the projection, default is 35")
parser.add_argument("-f", "--face_amount", default=100000, type=int, help="The face amount for the projection, default is 100,000")
parser.add_argument("-p", "--premium", default=1255.03, type=float, help="The annual premium for the projection, default is 1,255.03")
args = parser.parse_args()
insured = Insured(args.gender, args.risk_class, args.issue_age)
policy = Policy(insured, Product(), args.face_amount)
end_value = policy.illustrate_from_issue(args.premium)
print(end_value)
Then use the tool nearly the same as when we used simple.py:
(venv) ...\illustrator> python -m illustrator.objects -g M -r NS -i 35 -f 100000 -p 1255.03
132184.0426761172
This is great, but at this point we’ve only duplicated our original implementation and have not demonstrated how we can easily expand functionality. How can we do that? Abstract the Product class. First, let’s create an abstract base class BaseProduct which Product will subclass (e.g., Product will inherit from BaseProduct) and Policy will utilize. By creating an abstract base class we can define a blueprint that cannot be instantiated directly and must be finished so to speak by any subclass that inherits from the base class. Our subclasses of BaseProduct will be our actual products we want to model, generally referred to as concrete classes.
...
import abc
...
class BaseProduct(abc.ABC):
@abc.abstractmethod
def get_rates_for_insured(self, insured: Insured) -> Rates:
pass
@abc.abstractmethod
def illustrate_from_issue(self, insured: Insured, face_amount: int, annual_premium: float, rates: Rates) -> float:
pass
...
class Product(BaseProduct):
...
class Policy:
def __init__(self, insured: Insured, product: BaseProduct, face_amount: int):
...
The decorator @abc.abstractmethod above our two methods is the bit of functionality we include that forces the subclasses to implement the methods. This decorator combined with inheriting from abc.ABC will prevent Python from creating an object that has not implemented the decorated method. Our concrete product Product inherits from BaseProduct so the implementation responsibility is still required, however we know it is already fulfilled. We also tweak our Policy object by indicating the class will take a BaseProduct object which might be counter-intuitive. What this really means is that the Policy class requires an object of class BaseProduct or a subclass of BaseProduct. We build our Policy object functionality around the abstract BaseProduct and ignore the details of our concrete class Product. Now if we wanted to add another product, say Product2, we could have that be a subclass of BaseProduct, implement the two necessary methods, and use objects of class Product2 in a Policy object with no problems! This way we can isolate the functionality that changes between products without changing much else in our architecture!
A REFRESHED VIEW OF THE ARCHITECTURE
Our current architecture can be visualized in the following diagram (note: this is a very simple UML diagram and I am intentionally excluding methods and attributes).
Here we have that the Policy class is composed of the Insured and BaseProduct classes and has an association with the Rates class. The BaseProduct class has a dependency with Rates and is implemented by Product. We are treating BaseProduct as more of an interface. With this diagram in mind, it might now look obvious how we would add more products to our architecture, add more implementations of BaseProduct and nothing else changes in the diagram!
Update your README.md to reflect the separate execution instructions for the object-oriented solution and don’t forget to save your files, commit to your Git repo, and push to your online repository! We leave it to you to add tests for the new architecture as well.
If you are having difficulty with the code please refer to the v0.5 tagged version of this repository.
Have questions or feedback? Reach out to let us know!