Autumn 2014

TIES456 Introduction to SOA and Cloud Computing, 5 ECTS

TIES532 - Service oriented architectures and cloud computing for developers, 5 ECTS

Today

Review of week 42

Review week 42 - overview - databases

All databases look ok
- Some had problem with the small disk size
I could not log-in to the DB machine of group 8 and 10

Review week 42 - web servers

All ok, except
- Group 5 : optimizations missing
- Group 8 : …
- Group 10: cannot log in :(
  - But seen in class …
You should re-use the bucket objects and the memcache client

Review week 42 - caching

Caches are hard to get right

              @post('/add')
              def add():
                  ... get name and number from form
                  ... add to database

              @get('search/<name>')
              def search(name):
                    number = mcClient.get(name)
                    if (number is None):
                        number = riakBucket.get(name)
                        mcClient.set(name, number)
                     return "Some html with the number"

Like this in groups 2, 3, 4, 6, 7, 9

Review week 42 - caching

Problem : stale data in cache in the following case:
1. /add?name=A&number=0
  - DB: (A,0)
  - MC: -
2. /search?name=A //returns 0
  - DB: (A,0)
  - MC: (A,0)
3. /add?name=A&number=1 //update to 1
  - DB: (A,1)
  - MC: (A,0)
4. /search?name=A //returns 0
  - DB: (A,1)
  - MC: (A,0)

Review week 42 - caching

Caches are hard to get right - let’s try to solve this - attempt 1

              @post('/add')
              def add():
                  ... get name and number from form
                  ... remove from memcache
                  ... add to database

              @get('search/<name>')
              def search(name):
                    number = mcClient.get(name)
                    if (number is None):
                        number = riakBucket.get(name)
                        mcClient.set(name, number)
                     return "Some html with the number"

Review week 42 - caching

Problem : stale data in cache in the following (multi-threaded) case:

/add?name=A&number=0
- DB: (A,0)
- MC: -

Add and search simultaneously

/add?name=A&number=1	/search?name=A	DB	MC
Get data from form		(A,0)	(A,0)
remove from memcache		(A,0)	(-,-)
	number = mcClient.get(name)	(A,0)	(-,-)
	if (number is None):	(A,0)	(-,-)
	number = riakBucket.get(name)	(A,0)	(-,-)
	mcClient.set(name, number)	(A,0)	(A,0)
add to database		(A,1)	(A,0)

DB: (A,1)
MC: (A,0)

/search?name=A //returns 0
- DB: (A,1)
- MC: (A,0)

Review week 42 - caching

Caches are hard to get right - let’s try to solve this - attempt 2

              @post('/add')
              def add():
                  ... get name and number from form
                  ... add to database
                  ... remove from memcache

              @get('search/<name>')
              def search(name):
                    number = mcClient.get(name)
                    if (number is None):
                        number = riakBucket.get(name)
                        mcClient.set(name, number)
                     return "Some html with the number"

Review week 42 - caching

Problem : stale data in cache in the following (multi-threaded) case:

/add?name=A&number=0
- DB: (A,0)
- MC: -

Add and search simultaneously

/add?name=A&number=1	/search?name=A	DB	MC
	number = mcClient.get(name)	(A,0)	(A,0)
	if (number is None):	(A,0)	(-,-)
	number = riakBucket.get(name)	(A,0)	(-,-)
	#number == 0	(A,0)	(-,-)
Get data from form		(A,0)	(-,-)
remove from memcache		(A,0)	(-,-)
add to database		(A,1)	(-,-)
	mcClient.set(name, number)	(A,1)	(A,0)

DB: (A,1)
MC: (A,0)

/search?name=A //returns 0
- DB: (A,1)
- MC: (A,0)

Review week 42 - caching

You could also do

              @post('/add')
              def add():
                  ... get name and number from form
                  ... add to database
                  ... set on memcache = overwrite

Group 11, 12

Review week 42 - caching - a problem can still occur

Problem : stale data in cache in the following (multi-threaded) case:

/add?name=A&number=0
- DB: (A,0)
- MC: -

Add and search simultaneously

/add?name=A&number=0	/search?name=A	DB	MC
	number = mcClient.get(name)	(A,0)	(A,0)
	if (number is None):	(A,0)	(-,-)
	number = riakBucket.get(name)	(A,0)	(-,-)
	#number == 0	(A,0)	(-,-)
Get data from form		(A,0)	(-,-)
set on memcache		(A,0)	(A,1)
add to database		(A,1)	(A,1)
	mcClient.set(name, number)	(A,1)	(A,0)

DB: (A,1)
MC: (A,0)

/search?name=A //returns 0
- DB: (A,1)
- MC: (A,0)

Review week 42 - caching

Caches are hard to get right - let’s try to solve this - attempt by group 5

              @post('/add')
              def add():
                  ... get name and number from form
                  ... add to database
                  ... remove from memcache

              @get('search/<name>')
              def search(name):
                    number = mcClient.get(name)
                    if (number is None):
                        number = riakBucket.get(name)
                        mcClient.cas(name, number) #only do a set if not updated
                     return "Some html with the number"

Review week 42 - caching

Problem : The teacher cannot find any guarantees on whether the CAS counter is updated on delete. If this is not the case, the following could happen: stale data in cache in the following (multi-threaded) case:

/add?name=A&number=1
- DB: (A,1)
- MC: (-,-)

Add and search simultaneously

/add?name=A&number=2	/search?name=A	DB	MC
	number = mcClient.get(name)	(A,1)	(-,-)
	if (number is None):	(A,1)	(-,-)
	number = riakBucket.get(name)	(A,1)	(-,-)
Get data from form		(A,1)	(-,-)
add to database		(A,2)	(-,-)
remove from memcache		(A,2)	(-,-)
	mcClient.cas(name, number) #case1	(A,2)	(A,1)
	mcClient.cas(name, number) #case2	(A,2)	(-,-)

case 1
- The counter is not updated on delete and null == null, then the cas will succeed!
- DB: (A,2)
- MC: (A,1)
1. /search?name=A //returns 1
  - DB: (A,2)
  - MC: (A,1)
case 2
- The counter is updated or null != null, then the cas will fail.
- In this case the problem seems solved.
- DB: (A,2)
- MC: (-,-)
1. /search?name=A //returns 2
  - DB: (A,2)
  - MC: (A,2)

Review week 42 - caching

Caches are hard to get right
- One feasible way seems to put a time-out on your memcache entries and time a request can take.
  - It might still be that old data is returned, but it should be rare and only for limited time.
- Another way is by using a compare-and-swap operation, where you only overwrite if the data is what you expect.
- If there are multiple memcache servers, they will not inform each other : remove from all
  - If this is a hot piece of data, your database will get be hit a lot of times with the same request

Thesis topics - caching

Caching using groupcache could solve some headaches
- Similar idea to cache from Guava
- The cache is reponsible for fetching missing data

Review week 42 - caching - another problem

In search

              if mc.get(name):
                 return mc[name]

              mc[name] = fetched.encoded_data
              return mc[name]

It might be that the data in memcache is removed in between the two calls.

Discussion about reflective questions

How would you implement a page which shows a list of contacts. See also List keys.
Why would you ever use a set-up with virtual machines in a real (production) environment? Or would you not?
Which of the optimizations made sense, which ones not?
What should be improved in this exercise if it is given to students in the future?

Listing keys

I would probably just stream the keys to the client using the provided method. I’m not sure what would be the better way to do it though this definitely isn’t a smart thing to do as the amount of keys can be quite high.

According to the Riak documentation using List keys is not feasible in production environment, because it causes the database to go through all the entries in the database. This is slow and requires too much resources to be practical. One way to implement a page that shows a list of files could be to use Riak’s secondary indexes. Secondary indexes are keywords that the database can use to narrow down the search. In file listing’s case one could use for example the file type as a secondary index key. Secondary key utilities also support range searches and for example pagination which would be handy if the amount of files is large. Naturally, one should use the streaming capabilities of Riak to get the keys.

Virtual machines in production

Benefits
- easy deployment (you can define templates)
- improve utilization rate by putting multiple VM on one real server - cost effective
- security (restrict access)
  - Isolated environment for testing things
  - Fine grained access control
- Some protection against hardware failure
- State can be saved in a file and the whole machine can be moved
- Fast ‘network’ connection between machines
  - You can try bad connections too
Drawback
- For the absolute performance virtual machines are not the best, but security wise they make a lot of sense.
  - HPC is never done using virtual machines
Note
- Your http server was only accesible from the host (oksa3), one can set-up port forwarding such that one port of oksa3 is forwarded to a specific port on the VM

Optimizations

The optimizations all seemed to be useful.
As seen in the table every optimization made sense, especially the cache and the change from CherryPy to Bootle improved the speed.

	CherryPy	Bottle
	HTTP	PBC	HTTP	PBC
	Cache	No Cache	Cache	No Cache	Cache	No Cache	Cache	No Cache
Netem delay (1000ms, 5% pckg loss)	2,9/sec	0,59/sec	4,7/sec	0,99/sec	1,2/sec	0,2/sec	2,0/sec	0,33/sec
No Netem delay	81,7/sec	59,0/sec	81,9/sec	72,5/sec	83,0/sec	53,7/sec	83,7/sec	57,0/sec

The observed change depends on the order the optimizations are applied.
The cache only makes sense if the network to the database is slow.

Improvements

Give everyone a chance to create VMs
Difference in background is still an issue
- Some pointers to fast introduction to python
How to setup memcached
How to debug my application
scp command could be helful for students who are not very familiar with Linux
Testing the load balancer when the cluster is running on multiple servers can be interesting to observe and implement.

Disagreement

There’s a possibility for SQL injection when giving the key and value values (add and search) but we trust that Riak is created in such a smart manner that they are dealt with internally (since we don’t input actual SQL).

The database is not published online and only able to touch on the localhost and with the other virtual machine. It’s not really practicle.

Next week - Cloud computing - Cloud Services

Week 43

Advanced assignments [TIES532]

Complete the on-line course CS169.1x: Software as a Service.
- Starts October 21. Choose “Audit This Course” to access.
- You need 70% to pass the TIES532 course.
- Individual.

This week

Before Thursday
- Read prerequisite material
On Thursdays and Mondays
- Make the assignment in groups
Before Monday (23:59)
- Submit assignment using git (tip of master branch) - see task.
Before Tuesday’s lecture
- Prepare you presentation