Creating an Internal R Package: canaR

Lucia Darrow
Dec 9, 2020
3 min read

Use the same code three times, create a function.

Use the same function across multiple projects, create a package?

At CANA, we use the statistical programming language R across several projects for its ease of creating data-driven applications and reproducible reports. It’s a great option for exploring data, prototyping solutions, or even taking a model to production. This last summer, several R programmers on the CANA team collaborated on an internal package, canaR, to share standard functions and formatting across our R products. In this post, we share some lessons learned from creating an internal package.

The DRY Principle

Most programmers are familiar with the DRY principle (Don’t Repeat Yourself) which aims at reducing repetition in code. Reasons to follow this principle are abundant: less effort for the programmer, reduced chance for error across multiple uses of the same code, and streamlined testing.

In R, this translates to the use of functions and packages as a best practice. Packages are a natural unit to distribute code within a team as they include functions, tests, documentation, and vignettes. The package development process has become increasingly accessible in recent years due to tools such as usethis, testthat, and roxygen2.

Package Development Approach

The canaR development process was a collaborative effort led by CANA’s R programmers. For our internal package, functions fell into three key categories: style and branding, rounding, and visualizations.

Style and branding functions in the canaR package provide uniformity across our R products by creating a standard for document, table, and plot appearance. These offer a jumping off point for tailoring by analysts as they utilize the formatting functions and geoms on different projects. The canaR development team worked with CANA’s graphic designer, Koa Beam, to create custom, color blind-friendly palettes to fit categorical, sequential, and diverging types of data visualizations.
Rounding functions are another crucial and often underestimated challenge for standardized results. Working with repetition results of a simulation and/or collaborating with team members that use a different software can lead to some tricky rounding challenges. Sample functions in the canaR package that tackle these challenges perform actions such as aligning rounding results to what is expected in Excel, or controlling rounding for aggregations.
Visualization functions in canaR create unique data visuals not covered by existing packages. One of the functions that is helpful for working with dateless planning scenarios is the ‘relative timeline,’ created by CANA team member, Aaron Luprek. This data visualization function creates a time line centered around zero, which allows for showing results that are time-based but not associated with a certain date.

Sample Relative Timeline Graphic

Once the functions were built, the package development team worked together to create clear documentation, examples, and vignettes for future users. We used the testthat package to streamline our testing approach.

The final touch was the package naming convention. In the tradition of some of our favorite tidyverse packages, we saw the opportunity to incorporate a little French (ala magrittr) and an animal reference (ala purrr). Hence, canaR as a sly reference to the ‘canard,’ French for duck, and a diverting duck logo.

Resources

Interested in creating your own R package? There are many resources available! Here are just a few that our team found helpful in developing canaR:

Hadley Wickham and Jenny Bryan - R Packages (Book)
Hilary Parker - Writing an R package from scratch (Blog)
Malcolm Barrett - Zen and the Art of R Package Development (Video)
John Muschelli - R Package Development Series (Video)

Lucia Darrow

Is a Senior Operations Research Analyst at CANA Advisors and can be reached through her LinkedIn profile, or via

email at: ldarrow@canallc.com

21 Comments

Skyinplay

Jul 26

Radhe Exchange is India’s trusted online betting exchange offering live sports betting, fast withdrawals, real-time odds, and a secure platform. Bet smart, play fair, and win big with a smooth, user-friendly experience. Join now and start winning

Skyinplay is a leading online gaming platform in India, offering sports betting, casino games, fantasy leagues, and peer-to-peer exchange betting. Known for its user-friendly interface and secure environment, it supports popular Indian payment methods and provides 24/7 support. With games like Teen Patti, cricket betting, and live casino options, Skyinplay caters to both casual players and serious bettors, making it a top destination for real-money gaming.

Farmwald40365

Jul 24

go88 thu hút người chơi bởi tính ổn định và kho game đỉnh, go88 ist là điểm đến an toàn cho mọi game thủ. Tìm hiểu cách tạo tài khoản và khám phá hệ thống game của go88.

gold-365

Jul 23

Laser247 is your trusted online betting platform offering live sports, casino games, and secure gameplay. Enjoy real-time action, fast withdrawals, and 24/7 access on any device. Join Laser247 and play with confidence every time.

Laser 247 delivers a premium online gaming experience with real-time betting, casino games, and fast payment options. Enjoy secure access, easy registration, and 24/7 support. Whether you're a beginner or pro, Laser 247 offers something for everyone.

Creating an Internal R Package: canaR

Lucia Darrow

Recent Posts

21 Comments

CANA Site Map

CONTACT US

Thanks! Message sent.