0% found this document useful (0 votes)

1K views5 pages

Lakos Large Scale C++

The document discusses the top 10 concepts that every software engineer should know. It summarizes each concept in 1-2 paragraphs. The concepts are: interfaces, conventions and templates, layering, algorithmic complexity, hashing, caching, concurrency, cloud computing, security, and relational databases. Each concept is a fundamental area of knowledge that software engineers need to be familiar with to design and build software systems.

Uploaded by

api-1752250

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views5 pages

Lakos Large Scale C++

Uploaded by

api-1752250

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Top 10 Concepts That Every Software Engineer

Should Know
Written by Alex Iskold / July 22, 2008 8:21 PM / 48 Comments

The future of software development is about good craftsmen. With infrastructure

like Amazon Web Services and an abundance of basic libraries, it no longer takes a village to build a
good piece of software.

These days, a couple of engineers who know what they are doing can deliver complete systems. In this
post, we discuss the top 10 concepts software engineers should know to achieve that.

A successful software engineer knows and uses design patterns, actively refactors code, writes unit tests
and religiously seeks simplicity. Beyond the basic methods, there are concepts that good software
engineers know about. These transcend programming languages and projects - they are not design
patterns, but rather broad areas that you need to be familiar with. The top 10 concepts are:

1. Interfaces
2. Conventions and Templates
3. Layering
4. Algorithmic Complexity
5. Hashing
6. Caching
7. Concurrency
8. Cloud Computing
9. Security
10. Relational Databases

10. Relational Databases

Relational Databases have recently been getting a bad name because they cannot
scale well to support massive web services. Yet this was one of the most
fundamental achievements in computing that has carried us for two decades and
will remain for a long time. Relational databases are excellent for order
management systems, corporate databases and P&L data.

At the core of the relational database is the concept of representing information in

records. Each record is added to a table, which defines the type of information.
The database offers a way to search the records using a query language, nowadays
SQL. The database offers a way to correlate information from multiple tables.

The technique of data normalization is about correct ways of partitioning the data among tables to
minimize data redundancy and maximize the speed of retrieval.

9. Security
With the rise of hacking and data sensitivity, the security is paramount. Security is
a broad topic that includes authentication, authorization, and information
transmission.

Authentication is about verifying user identity. A typical website prompts for a

password. The authentication typically happens over SSL (secure socket layer), a
way to transmit encrypted information over HTTP. Authorization is about
permissions and is important in corporate systems, particularly those that define
workflows. The recently developed OAuth protocol helps web services to enable
users to open access to their private information. This is how Flickr permits
access to individual photos or data sets.

Another security area is network protection. This concerns operating systems, configuration and
monitoring to thwart hackers. Not only network is vulnerable, any piece of software is. Firefox browser,
marketed as the most secure, has to patch the code continuously. To write secure code for your system
requires understanding specifics and potential problems.

8. Cloud Computing
In our recent post Reaching For The Sky Through Compute Clouds we talked
about how commodity cloud computing is changing the way we deliver large-
scale web applications. Massively parallel, cheap cloud computing reduces both
costs and time to market.

Cloud computing grew out of parallel computing, a concept that many problems
can be solved faster by running the computations in parallel.

After parallel algorithms came grid computing, which ran parallel computations
on idle desktops. One of the first examples was SETI@home project out of
Berkley, which used spare CPU cycles to crunch data coming from space. Grid
computing is widely adopted by financial companies, which run massive risk calculations. The concept of
under-utilized resources, together with the rise of J2EE platform, gave rise to the precursor of cloud
computing: application server virtualization. The idea was to run applications on demand and change
what is available depending on the time of day and user activity.

Today's most vivid example of cloud computing is Amazon Web Services, a package available via API.
Amazon's offering includes a cloud service (EC2), a database for storing and serving large media files
(S3), an indexing service (SimpleDB), and the Queue service (SQS). These first blocks already empower
an unprecedented way of doing large-scale computing, and surely the best is yet to come.

7. Concurrency
Concurrency is one topic engineers notoriously get wrong, and understandibly so,
because the brain does juggle many things at a time and in schools linear thinking
is emphasized. Yet concurrency is important in any modern system.

Concurrency is about parallelism, but inside the application. Most modern

languages have an in-built concept of concurrency; in Java, it's implemented using
Threads.

A classic concurrency example is the producer/consumer, where the producer

generates data or tasks, and places it for worker threads to consume and execute.
The complexity in concurrency programming stems from the fact Threads often
needs to operate on the common data. Each Thread has its own sequence of execution, but accesses
common data. One of the most sophisticated concurrency libraries has been developed by Doug Lea and
is now part of core Java.

6. Caching
No modern web system runs without a cache, which is an in-memory store that
holds a subset of information typically stored in the database. The need for cache
comes from the fact that generating results based on the database is costly. For
example, if you have a website that lists books that were popular last week, you'd
want to compute this information once and place it into cache. User requests fetch
data from the cache instead of hitting the database and regenerating the same
information.

Caching comes with a cost. Only some subsets of information can be stored in
memory. The most common data pruning strategy is to evict items that are least
recently used (LRU). The prunning needs to be efficient, not to slow down the
application.

A lot of modern web applications, including Facebook, rely on a distributed caching system called
Memcached, developed by Brad Firzpatrick when working on LiveJournal. The idea was to create a
caching system that utilises spare memory capacity on the network. Today, there are Memcached libraries
for many popular languages, including Java and PHP.

5. Hashing
The idea behind hashing is fast access to data. If the data is stored sequentially,
the time to find the item is proportional to the size of the list. For each element, a
hash function calculates a number, which is used as an index into the table. Given
a good hash function that uniformly spreads data along the table, the look-up time
is constant. Perfecting hashing is difficult and to deal with that hashtable
implementations support collision resolution.

Beyond the basic storage of data, hashes are also important in distributed systems.
The so-called uniform hash is used to evenly allocate data among computers in a
cloud database. A flavor of this technique is part of Google's indexing service;
each URL is hashed to particular computer. Memcached similarly uses a hash
function.

Hash functions can be complex and sophisticated, but modern libraries have good defaults. The important
thing is how hashes work and how to tune them for maximum performance benefit.

4. Algorithmic Complexity
There are just a handful of things engineers must know about algorithmic
complexity. First is big O notation. If something takes O(n) it's linear in the size
of data. O(n^2) is quadratic. Using this notation, you should know that search
through a list is O(n) and binary search (through a sorted list) is log(n). And
sorting of n items would take n*log(n) time.
Your code should (almost) never have multiple nested loops (a loop inside a loop inside a loop). Most of
the code written today should use Hashtables, simple lists and singly nested loops.

Due to abundance of excellent libraries, we are not as focused on efficiency these days. That's fine, as
tuning can happen later on, after you get the design right.

Elegant algorithms and performance is something you shouldn't ignore. Writing compact and readable
code helps ensure your algorithms are clean and simple.

3. Layering
Layering is probably the simplest way to discuss software architecture. It first got
serious attention when John Lakos published his book about Large-scale C++
systems. Lakos argued that software consists of layers. The book introduced the
concept of layering. The method is this. For each software component, count the
number of other components it relies on. That is the metric of how complex the
component is.

Lakos contended a good software follows the shape of a pyramid; i.e., there's a
progressive increase in the cummulative complexity of each component, but not
in the immediate complexity. Put differently, a good software system consists of
small, reusable building blocks, each carrying its own responsibility. In a good
system, no cyclic dependencies between components are present and the whole system is a stack of layers
of functionality, forming a pyramid.

Lakos's work was a precursor to many developments in software engineering, most notably Refactoring.
The idea behind refactoring is continuously sculpting the software to ensure it'is structurally sound and
flexible. Another major contribution was by Dr Robert Martin from Object Mentor, who wrote about
dependecies and acyclic architectures

Among tools that help engineers deal with system architecture are Structure 101 developed by Headway
software, and SA4J developed by my former company, Information Laboratory, and now available from
IBM.

2. Conventions and Templates

Naming conventions and basic templates are the most overlooked software
patterns, yet probably the most powerful.

Naming conventions enable software automation. For example, Java Beans

framework is based on a simple naming convention for getters and setters. And
canonical URLs in del.icio.us: http://del.icio.us/tag/software take the user to the
page that has all items tagged software.

Many social software utilise naming conventions in a similar way. For example, if
your user name is johnsmith then likely your avatar is johnsmith.jpg and your rss
feed is johnsmith.xml.

Naming conventions are also used in testing, for example JUnit automatically recognizes all the methods
in the class that start with prefix test.

The templates are not C++ or Java language constructs. We're talking about template files that contain
variables and then allow binding of objects, resolution, and rendering the result for the client.
Cold Fusion was one of the first to popularize templates for web applications. Java followed with JSPs,
and recently Apache developed handy general purpose templating for Java called Velocity. PHP can be
used as its own templating engine because it supports eval function (be careful with security). For XML
programming it is standard to use XSL language to do templates.

From generation of HTML pages to sending standardized support emails, templates are an essential
helper in any modern software system.

1. Interfaces
The most important concept in software is interface. Any good software is a
model of a real (or imaginary) system. Understanding how to model the problem
in terms of correct and simple interfaces is crucial. Lots of systems suffer from
the extremes: clumped, lengthy code with little abstractions, or an overly designed
system with unnecessary complexity and unused code.

Among the many books, Agile Programming by Dr Robert Martin stands out
because of focus on modeling correct interfaces.

In modeling, there are ways you can iterate towards the right solution. Firstly,
never add methods that might be useful in the future. Be minimalist, get away
with as little as possible. Secondly, don't be afraid to recognize today that what you did yesterday wasn't
right. Be willing to change things. Thirdly, be patient and enjoy the process. Ultimately you will arrive at
a system that feels right. Until then, keep iterating and don't settle.

Conclusion
Modern software engineering is sophisticated and powerful, with decades of experience, millions of lines
of supporting code and unprecidented access to cloud computing. Today, just a couple of smart people
can create software that previously required the efforts of dozens of people. But a good craftsman still
needs to know what tools to use, when and why.

In this post we discussed concepts that are indispensible for software engineers. And now tell us please
what you would add to this list. Share with us what concepts you find indispensible in your daily software
engineering journeys.

B.Sc. Software Engineering Syllabus
No ratings yet
B.Sc. Software Engineering Syllabus
34 pages
2.se Notes
No ratings yet
2.se Notes
220 pages
L2 Fundamental Knowledge
No ratings yet
L2 Fundamental Knowledge
58 pages
Mini Paper
No ratings yet
Mini Paper
9 pages
Assignment
No ratings yet
Assignment
7 pages
Software Engineering Notes
100% (3)
Software Engineering Notes
9 pages
Module 01) Introduction To Software Engineering and Process Models Module
No ratings yet
Module 01) Introduction To Software Engineering and Process Models Module
25 pages
Software Engineering
50% (2)
Software Engineering
19 pages
CSE494/598 Principles of Information Engineering
No ratings yet
CSE494/598 Principles of Information Engineering
45 pages
Intro to Software Engineering
No ratings yet
Intro to Software Engineering
22 pages
Demo Report
No ratings yet
Demo Report
59 pages
Chapter 1
No ratings yet
Chapter 1
24 pages
Software Engineering Process Model
No ratings yet
Software Engineering Process Model
78 pages
Software Engineering: Overview and Basic Tasks DR Hamed Hemeda
No ratings yet
Software Engineering: Overview and Basic Tasks DR Hamed Hemeda
117 pages
Emerging Trends
No ratings yet
Emerging Trends
33 pages
Software Engineering Basics
No ratings yet
Software Engineering Basics
46 pages
Freebie - Top 52 Interview Q&A For SWEs
No ratings yet
Freebie - Top 52 Interview Q&A For SWEs
55 pages
Sepm Unit 1 Shalini
No ratings yet
Sepm Unit 1 Shalini
43 pages
Interview Questions
No ratings yet
Interview Questions
10 pages
Unit 1
No ratings yet
Unit 1
30 pages
S.E Text Book.
No ratings yet
S.E Text Book.
38 pages
L2 Fundamental Knowledge
No ratings yet
L2 Fundamental Knowledge
58 pages
Synopsis Table of Contents: Project Report 33
No ratings yet
Synopsis Table of Contents: Project Report 33
249 pages
Chuong01 Tongquan CNPM
No ratings yet
Chuong01 Tongquan CNPM
29 pages
System Design Cheat Sheet
No ratings yet
System Design Cheat Sheet
6 pages
Ch1 Handouts
No ratings yet
Ch1 Handouts
7 pages
2.introduction To Software Engineering-1
No ratings yet
2.introduction To Software Engineering-1
56 pages
Software Engineering-Unit Wise Notes-1 &2
No ratings yet
Software Engineering-Unit Wise Notes-1 &2
51 pages
Samsung Lect 1 Introduction
No ratings yet
Samsung Lect 1 Introduction
62 pages
Syllabus: Concepts of Management Computing
No ratings yet
Syllabus: Concepts of Management Computing
7 pages
Short Note 1
No ratings yet
Short Note 1
5 pages
SEPM Module 1
No ratings yet
SEPM Module 1
29 pages
Software Engineering Overview
No ratings yet
Software Engineering Overview
98 pages
Final Sda
No ratings yet
Final Sda
13 pages
Special Topics Project-2
No ratings yet
Special Topics Project-2
30 pages
Software Engineering - Unit-1
No ratings yet
Software Engineering - Unit-1
21 pages
Peco Lodg
No ratings yet
Peco Lodg
15 pages
Documentation UDBI
No ratings yet
Documentation UDBI
102 pages
Williams Draft Book
No ratings yet
Williams Draft Book
295 pages
SE Lec1
No ratings yet
SE Lec1
22 pages
Class 12 Informatics Practices Q&A
No ratings yet
Class 12 Informatics Practices Q&A
3 pages
1 Se
No ratings yet
1 Se
44 pages
Mod 01 Part 1
No ratings yet
Mod 01 Part 1
51 pages
1unit SE Notes
No ratings yet
1unit SE Notes
7 pages
Computer Science Field Guide - Student Version
No ratings yet
Computer Science Field Guide - Student Version
580 pages
Software Engineering Fundamentals
No ratings yet
Software Engineering Fundamentals
76 pages
Software Engineering-Unit-1
No ratings yet
Software Engineering-Unit-1
52 pages
SWDBS401 - Backend System Design-Notes-Printed
No ratings yet
SWDBS401 - Backend System Design-Notes-Printed
25 pages
B.Tech Software Engineering Notes
No ratings yet
B.Tech Software Engineering Notes
85 pages
Chapter 01hsd
No ratings yet
Chapter 01hsd
30 pages
Code 10
No ratings yet
Code 10
2 pages
SPM Module 1
No ratings yet
SPM Module 1
93 pages
Definitions
No ratings yet
Definitions
19 pages
Software Development Is An Incredibly in
No ratings yet
Software Development Is An Incredibly in
4 pages
Software Engineering Notes
No ratings yet
Software Engineering Notes
102 pages
Software Engineering: Dr. Gaurav Srivastava
No ratings yet
Software Engineering: Dr. Gaurav Srivastava
185 pages
Preparing For Capm Pmi
No ratings yet
Preparing For Capm Pmi
9 pages
Managing Software Requirements
No ratings yet
Managing Software Requirements
16 pages
Expert Reference Series
No ratings yet
Expert Reference Series
9 pages
RCM Decision Worksheet
No ratings yet
RCM Decision Worksheet
55 pages
Hash Function Instruction Count
No ratings yet
Hash Function Instruction Count
6 pages
(Ref) RUP - IBM Rational Unified Process
100% (3)
(Ref) RUP - IBM Rational Unified Process
30 pages
RUP Best Practices
100% (17)
RUP Best Practices
21 pages
© Shubham Wadekar: JP Morgan & Chase Data Engineer Interview Guide - Experienced
No ratings yet
© Shubham Wadekar: JP Morgan & Chase Data Engineer Interview Guide - Experienced
9 pages
Timer Wheel PDF
No ratings yet
Timer Wheel PDF
24 pages
Teradata Indexing Guide
No ratings yet
Teradata Indexing Guide
21 pages
Jim Gillespie - Cbo
No ratings yet
Jim Gillespie - Cbo
129 pages
Birla Institute of Technology & Science, Pilani: Work Integrated Learning Programmes
No ratings yet
Birla Institute of Technology & Science, Pilani: Work Integrated Learning Programmes
9 pages
Data Structures Lab Guide
No ratings yet
Data Structures Lab Guide
78 pages
Roadmap
No ratings yet
Roadmap
2 pages
Ads-Unit I-Hashing
No ratings yet
Ads-Unit I-Hashing
14 pages
Btech JNTU CSE-2-1-DS
No ratings yet
Btech JNTU CSE-2-1-DS
2 pages
Module 5
No ratings yet
Module 5
25 pages
Data Structures Course Overview
No ratings yet
Data Structures Course Overview
3 pages
Data Structure Mcqs
100% (1)
Data Structure Mcqs
9 pages
Chapter 11-Hash Tables
No ratings yet
Chapter 11-Hash Tables
42 pages
Data Structures Course Overview
No ratings yet
Data Structures Course Overview
2 pages
CST 204 Dbms Module - 3 Physical Data Organization
No ratings yet
CST 204 Dbms Module - 3 Physical Data Organization
93 pages
How To Rock An Algorithms Interview
No ratings yet
How To Rock An Algorithms Interview
3 pages
Java Collections
No ratings yet
Java Collections
4 pages
Design Pattern by SEKHAR SIR
67% (6)
Design Pattern by SEKHAR SIR
42 pages
Collection Programs
No ratings yet
Collection Programs
29 pages
Laboratory Plan - DS
No ratings yet
Laboratory Plan - DS
2 pages
Intro to Info Retrieval Systems
No ratings yet
Intro to Info Retrieval Systems
48 pages
Android Interview Question
No ratings yet
Android Interview Question
56 pages
Section01 Solutions
No ratings yet
Section01 Solutions
15 pages
cd3281 Final Copy Lab Manual
100% (2)
cd3281 Final Copy Lab Manual
44 pages
Python & Data Analysis Guide
No ratings yet
Python & Data Analysis Guide
66 pages
Python ADT Implementation Guide
No ratings yet
Python ADT Implementation Guide
35 pages
Chapter - 2: Data Strucuters For Language Processing
No ratings yet
Chapter - 2: Data Strucuters For Language Processing
16 pages
Implementation Priority Queue Using Array
No ratings yet
Implementation Priority Queue Using Array
3 pages
Data Structure and Algorithm Syllabus
No ratings yet
Data Structure and Algorithm Syllabus
2 pages
Data Structure Algorithms: Resources Used
No ratings yet
Data Structure Algorithms: Resources Used
15 pages

Lakos Large Scale C++

Uploaded by

Lakos Large Scale C++

Uploaded by

Top 10 Concepts That Every Software Engineer

The future of software development is about good craftsmen. With infrastructure

10. Relational Databases

At the core of the relational database is the concept of representing information in

Authentication is about verifying user identity. A typical website prompts for a

Concurrency is about parallelism, but inside the application. Most modern

A classic concurrency example is the producer/consumer, where the producer

2. Conventions and Templates

Naming conventions enable software automation. For example, Java Beans

You might also like