Smart Pointers - What, Why, Which?

Similar documents
Project. C++: Smart Pointers. The Plan. Announcement. Memory Leak. Pointer Ownership. Today: STL 1 Wednesday: STL 2 Thursday: Smart Pointers

ALL ABOUT POINTERS C/C++ POINTERS

Smart Pointers. Some slides from Internet

CSCI-1200 Data Structures Fall 2011 Lecture 24 Garbage Collection & Smart Pointers

CSCI-1200 Data Structures Spring 2017 Lecture 27 Garbage Collection & Smart Pointers

Memory Leak. C++: Memory Problems. Memory Leak. Memory Leak. Pointer Ownership. Memory Leak

Pointers. Developed By Ms. K.M.Sanghavi

CSCI-1200 Data Structures Spring 2018 Lecture 25 Garbage Collection & Smart Pointers

COMP6771 Advanced C++ Programming

Laboratorio di Tecnologie dell'informazione. Ing. Marco Bertini

G52CPP C++ Programming Lecture 20

Assignment 1: SmartPointer

CS 251 INTERMEDIATE SOFTWARE DESIGN SPRING C ++ Basics Review part 2 Auto pointer, templates, STL algorithms

1/29/2011 AUTO POINTER (AUTO_PTR) INTERMEDIATE SOFTWARE DESIGN SPRING delete ptr might not happen memory leak!

Allocation & Efficiency Generic Containers Notes on Assignment 5

CS 241 Honors Memory

Exception Safety. CS 311 Data Structures and Algorithms Lecture Slides Wednesday, October 28, Glenn G. Chappell. continued

Chapter 17 vector and Free Store. Bjarne Stroustrup

Exception Namespaces C Interoperability Templates. More C++ David Chisnall. March 17, 2011

Cute C++ idioms Nikhil Marathe.

QUIZ Friends class Y;

Chapter 17 vector and Free Store

05-01 Discussion Notes

Assertions and Exceptions

Consider the program...

Chapter 17 vector and Free Store

StackVsHeap SPL/2010 SPL/20

Vector and Free Store (Pointers and Memory Allocation)

CSE 333 Lecture smart pointers

CA341 - Comparative Programming Languages

6 Architecture of C++ programs

Pointers and Memory 1

Garbage Collection Techniques in C++

C++ Programming Lecture 4 Software Engineering Group

vector and Free Store

Exercise 6.2 A generic container class

Memory management COSC346

CSE 333 Lecture smart pointers

Homework 4. Any questions?

QUIZ on Ch.5. Why is it sometimes not a good idea to place the private part of the interface in a header file?

Introducing C++ to Java Programmers

The vector Class and Memory Management Chapter 17. Bjarne Stroustrup Lawrence "Pete" Petersen Walter Daugherity Fall 2007

Chapter 13: Copy Control. Overview. Overview. Overview

Acknowledgements These slides are based on Kathryn McKinley s slides on garbage collection as well as E Christopher Lewis s slides

Introduction to C++ Part II. Søren Debois. Department of Theoretical Computer Science IT University of Copenhagen. September 12th, 2005

Implementing Abstractions


Proposed Wording for Concurrent Data Structures: Hazard Pointer and Read Copy Update (RCU)

G52CPP C++ Programming Lecture 16

Vector and Free Store (Vectors and Arrays)

Cpt S 122 Data Structures. Course Review Midterm Exam # 2

Item 3: Predicates, Part 2: Matters of State

Structure of Programming Languages Lecture 10

Engineering Robust Server Software

COMP6771 Advanced C++ Programming

Creating the C++ auto_ptr<> utility for Symbian OS

Abstract. Vector. Overview. Building from the ground up. Building from the ground up 1/8/10. Chapter 17 vector and Free Store

6. Pointers, Structs, and Arrays. 1. Juli 2011

C++ Primer. CS 148 Autumn

CMSC 330: Organization of Programming Languages. Ownership, References, and Lifetimes in Rust

Lecture 23: Object Lifetime and Garbage Collection

Object-Oriented Programming for Scientific Computing

DYNAMIC MEMORY ALLOCATION ON REAL-TIME LINUX

D Programming Language

Object Oriented Software Design II

In Java we have the keyword null, which is the value of an uninitialized reference type

CSE 100: C++ TEMPLATES AND ITERATORS

CMSC 330: Organization of Programming Languages

One-Slide Summary. Lecture Outine. Automatic Memory Management #1. Why Automatic Memory Management? Garbage Collection.

Chapter 18 Vectors and Arrays [and more on pointers (nmm) ] Bjarne Stroustrup

CSE 333 Lecture smart pointers

A Generic Non-intrusive Smart Pointer Implementation

Memory and C++ Pointers

BYOD - WOCHE 2 AGENDA. Organization. Templates RAII. Smart Pointers

C++ for Java Programmers

Chapter 8 :: Composite Types

A Whirlwind Tour of C++

The issues. Programming in C++ Common storage modes. Static storage in C++ Session 8 Memory Management

CS201 Some Important Definitions

Programming Language Implementation

CSE 333. Lecture 11 - constructor insanity. Hal Perkins Paul G. Allen School of Computer Science & Engineering University of Washington

Performance of Non-Moving Garbage Collectors. Hans-J. Boehm HP Labs

Copy Constructors and Assignment Operators

static CS106L Spring 2009 Handout #21 May 12, 2009 Introduction

CS201 - Introduction to Programming Glossary By

CMSC 330: Organization of Programming Languages. Memory Management and Garbage Collection

You should never see the following pattern (in the sense of textual pattern) in your code:

CS162 - POINTERS. Lecture: Pointers and Dynamic Memory

CMSC 330: Organization of Programming Languages

Written by John Bell for CS 342, Spring 2018

Guidelines for Writing C Code

Lecture 15a Persistent Memory & Shared Pointers

Heap, Variables, References, and Garbage. CS152. Chris Pollett. Oct. 13, 2008.

Overview. Constructors and destructors Virtual functions Single inheritance Multiple inheritance RTTI Templates Exceptions Operator Overloading

Teuchos Utility Classes for Safer Memory Management in C++ Trilinos Users Group Meeting, October 22, 2008

Memory Management Techniques

Short Notes of CS201

No littering! Bjarne Stroustrup. Morgan Stanley, Columbia University

Automatic Memory Management

CSCI 171 Chapter Outlines

Transcription:

Page 1 of 7 Smart Pointers - What, Why, Which? Yonat Sharon What are they? Why would I use them? Less bugs Exception Safety Garbage collection Efficiency STL containers Which one should I use? Local variables Class members STL containers Explicit ownership transfer Big objects Summary Conclusion Source Code Analysis Analyze code structure and uncover must-fix bugs. Get Free trial now! Coverity.com/SourceCodeSolutions Hjemmeside med din Hund Har du en Pointer? Del din hund med venner & familie! www.123hjemmeside.dk What are they? Smart pointers are objects that look and feel like pointers, but are smarter. What does this mean? To look and feel like pointers, smart pointers need to have the same interface that pointers do: they need to support pointer operations like dereferencing (operator *) and indirection (operator ->). An object that looks and feels like something else is called a proxy object, or just proxy. The proxy pattern and its many uses are described in the books Design Patterns and Pattern Oriented Software Architecture. To be smarter than regular pointers, smart pointers need to do things that regular pointers don't. What could these things be? Probably the most common bugs in C++ (and C) are related to pointers and memory management: dangling pointers, memory leaks, allocation failures and other joys. Having a smart pointer take care of these things can save a lot of aspirin... The simplest example of a smart pointer is auto_ptr, which is included in the standard C++ library. You can find it in the header <memory>, or take a look at Scott Meyers' auto_ptr implementation. Here is part of auto_ptr's implementation, to illustrate what it does: template <class T> class auto_ptr T* ptr; public: explicit auto_ptr(t* p = 0) : ptr(p) ~auto_ptr() delete ptr; T& operator*() return *ptr; T* operator->() return ptr; //... ; As you can see, auto_ptr is a simple wrapper around a regular pointer. It forwards all meaningful operations to this pointer (dereferencing and indirection). Its smartness in the destructor: the destructor takes care of deleting the pointer. For the user of auto_ptr, this means that instead of writing:

Page 2 of 7 MyClass* p(new MyClass); You can write: auto_ptr<myclass> p(new MyClass); And trust p to cleanup after itself. What does this buy you? See the next section. Why would I use them? Obviously, different smart pointers offer different reasons for use. Here are some common reasons for using smart pointers in C++. Why: Less bugs Automatic cleanup. As the code above illustrates, using smart pointers that clean after themselves can save a few lines of code. The importance here is not so much in the keystrokes saved, but in reducing the probability for bugs: you don't need to remember to free the pointer, and so there is no chance you will forget about it. Automatic initialization. Another nice thing is that you don't need to initialize the auto_ptr to NULL, since the default constructor does that for you. This is one less thing for the programmer to forget. Dangling pointers. A common pitfall of regular pointers is the dangling pointer: a pointer that points to an object that is already deleted. The following code illustrates this situation: MyClass* p(new MyClass); MyClass* q = p; // Watch out! p is now dangling! p = NULL; // p is no longer dangling q->dosomething(); // Ouch! q is still dangling! For auto_ptr, this is solved by setting its pointer to NULL when it is copied: template <class T> auto_ptr<t>& auto_ptr<t>::operator=(auto_ptr<t>& rhs) if (this!= &rhs) delete ptr; ptr = rhs.ptr; rhs.ptr = NULL; return *this; Other smart pointers may do other things when they are copied. Here are some possible strategies for handling the statement q = p, where p and q are smart pointers:

Page 3 of 7 Create a new copy of the object pointed by p, and have q point to this copy. This strategy is implemented in copied_ptr.h. Ownership transfer: Let both p and q point to the same object, but transfer the responsibility for cleaning up ("ownership") from p to q. This strategy is implemented in owned_ptr.h. Reference counting: Maintain a count of the smart pointers that point to the same object, and delete the object when this count becomes zero. So the statement q = p causes the count of the object pointed by p to increase by one. This strategy is implemented in counted_ptr.h. Scott Meyers offers another reference counting implementation in his book More Effective C++. Reference linking: The same as reference counting, only instead of a count, maintain a circular doubly linked list of all smart pointers that point to the same object. This strategy is implemented in linked_ptr.h. Copy on write: Use reference counting or linking as long as the pointed object is not modified. When it is about to be modified, copy it and modify the copy. This strategy is implemented in cow_ptr.h. All these techniques help in the battle against dangling pointers. Each has each own benefits and liabilities. The Which section of this article discusses the suitability of different smart pointers for various situations. Why: Exception Safety Let's take another look at this simple example: MyClass* p(new MyClass); What happens if DoSomething() throws an exception? All the lines after it will not get executed and p will never get deleted! If we're lucky, this leads only to memory leaks. However, MyClass may free some other resources in its destructor (file handles, threads, transactions, COM references, mutexes) and so not calling it my cause severe resource locks. If we use a smart pointer, however, p will be cleaned up whenever it gets out of scope, whether it was during the normal path of execution or during the stack unwinding caused by throwing an exception. But isn't it possible to write exception safe code with regular pointers? Sure, but it is so painful that I doubt anyone actually does this when there is an alternative. Here is what you would do in this simple case: MyClass* p; try p = new MyClass; catch (...) throw; Now imagine what would happen if we had some if's and for's in there... Why: Garbage collection

Page 4 of 7 Since C++ does not provide automatic garbage collection like some other languages, smart pointers can be used for that purpose. The simplest garbage collection scheme is reference counting or reference linking, but it is quite possible to implement more sophisticated garbage collection schemes with smart pointers. For more information see the garbage collection FAQ. Why: Efficiency Smart pointers can be used to make more efficient use of available memory and to shorten allocation and deallocation time. A common strategy for using memory more efficiently is copy on write (COW). This means that the same object is shared by many COW pointers as long as it is only read and not modified. When some part of the program tries to modify the object ("write"), the COW pointer creates a new copy of the object and modifies this copy instead of the original object. The standard string class is commonly implemented using COW semantics (see the <string> header). string s("hello"); string t = s; t += " there!"; // t and s point to the same buffer of characters // a new buffer is allocated for t before // appending " there!", so s is unchanged. Optimized allocation schemes are possible when you can make some assumptions about the objects to be allocated or the operating environment. For example, you may know that all the objects will have the same size, or that they will all live in a single thread. Although it is possible to implement optimized allocation schemes using class-specific new and delete operators, smart pointers give you the freedom to choose whether to use the optimized scheme for each object, instead of having the scheme set for all objects of a class. It is therefore possible to match the allocation scheme to different operating environments and applications, without modifying the code for the entire class. Why: STL containers The C++ standard library includes a set of containers and algorithms known as the standard template library (STL). STL is designed to be generic (can be used with any kind of object) and efficient (does not incur time overhead compared to alternatives). To achieve these two design goals, STL containers store their objects by value. This means that if you have an STL container that stores objects of class Base, it cannot store of objects of classes derived from Base. class Base /*...*/ ; class Derived : public Base /*...*/ ; Base b; Derived d; vector<base> v; v.push_back(b); // OK v.push_back(d); // error What can you do if you need a collection of objects from different classes? The simplest solution is to have a collection of pointers: vector<base*> v; v.push_back(new Base); v.push_back(new Derived); // OK // OK too // cleanup: for (vector<base*>::iterator i = v.begin(); i!= v.end(); ++i)

Page 5 of 7 delete *i; The problem with this solution is that after you're done with the container, you need to manually cleanup the objects stored in it. This is both error prone and not exception safe. Smart pointers are a possible solution, as illustrated below. (An alternative solution is a smart container, like the one implemented in pointainer.h.) vector< linked_ptr<base> > v; v.push_back(new Base); // OK v.push_back(new Derived); // OK too // cleanup is automatic Since the smart pointer automatically cleans up after itself, there is no need to manually delete the pointed objects. Note: STL containers may copy and delete their elements behind the scenes (for example, when they resize themselves). Therefore, all copies of an element must be equivalent, or the wrong copy may be the one to survive all this copying and deleting. This means that some smart pointers cannot be used within STL containers, specifically the standard auto_ptr and any ownership-transferring pointer. For more info about this issue, see C++ Guru of the Week #25. Which one should I use? Are you confused enough? Well, this summary should help. Which: Local variables The standard auto_ptr is the simplest smart pointer, and it is also, well, standard. If there are no special requirements, you should use it. For local variables, it is usually the right choice. Which: Class members Although you can use auto_ptr as a class member (and save yourself the trouble of freeing objects in the destructor), copying one object to another will nullify the pointer, as illustrated Below. class MyClass auto_ptr<int> p; //... ; MyClass x; // do some meaningful things with x MyClass y = x; // x.p now has a NULL pointer Using a copied pointer instead of auto_ptr solves this problem: the copied object (y) gets a new copy of the member. Note that using a reference counted or reference linked pointer means that if y changes the member, this change will also affect x! Therefore, if you want to save memory, you should use a COW pointer and not a simple reference counted/linked pointer. Which: STL containers As explained above, using garbage-collected pointers with STL containers lets you store objects from

Page 6 of 7 different classes in the same container. It is important to consider the characteristics of the specific garbage collection scheme used. Specifically, reference counting/linking can leak in the case of circular references (i.e., when the pointed object itself contains a counted pointer, which points to an object that contains the original counted pointer). Its advantage over other schemes is that it is both simple to implement and deterministic. The deterministic behavior may be important in some real time systems, where you cannot allow the system to suddenly wait while the garbage collector performs its housekeeping duties. Generally speaking, there are two ways to implement reference counting: intrusive and non-intrusive. Intrusive means that the pointed object itself contains the count. Therefore, you cannot use intrusive reference counting with 3-rd party classes that do not already have this feature. You can, however, derive a new class from the 3-rd party class and add the count to it. Non-intrusive reference counting requires an allocation of a count for each counted object. The counted_ptr.h is an example of non-intrusive reference counting. Reference linking does not require any changes to be made to the pointed objects, nor does it require any additional allocations. A reference linked pointer takes a little more space than a reference counted pointer - just enough to store one or two more pointers. Both reference counting and reference linking require using locks if the pointers are used by more than one thread of execution. Which: Explicit ownership transfer Sometimes, you want to receive a pointer as a function argument, but keep the ownership of this pointer (i.e. the control over its lifetime) to yourself. One way to do this is to use consistent naming-conventions for such cases. Taligent's Guide to Designing Programs recommends using "adopt" to mark that a function adopts ownership of a pointer. Using an owned pointer as the function argument is an explicit statement that the function is taking ownership of the pointer. Which: Big objects If you have objects that take a lot of space, you can save some of this space by using COW pointers. This

Page 7 of 7 way, an object will be copied only when necessary, and shared otherwise. The sharing is implemented using some garbage collection scheme, like reference counting or linking. Which: Summary For this: Local variables Class members STL Containers Explicit ownership transfer Big objects Use that: auto_ptr Copied pointer Garbage collected pointer (e.g. reference counting/linking) Owned pointer Copy on write Conclusion Smart pointers are useful tools for writing safe and efficient code in C++. Like any tool, they should be used with appropriate care, thought and knowledge. For a comprehensive and in depth analysis of the issues concerning smart pointers, I recommend reading Andrei Alexandrescu's chapter about smart pointers in his book Modern C++ Design. Feel free to use my own smart pointers in your code. The Boost C++ libraries include some smart pointers, which are more rigorously tested and actively maintained. Do try them first, if they are appropriate for your needs. Copyright 1999 by Yonat Sharon