Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3
HOMEWORK: 1 WEEK
You are a software developer at Digital LTD, a software development
company wishing to provide a console application to download a web application as a service to the public citizen, and you are part of the tasked team of engineers. The download manager is a sub system that once provided the URL of a given website as input from the console, it goes on downloading the homepage of the website to the hard disk, and revisit the home page extracting all external links and download corresponding pages and folders to complete the whole website download operation to your local computer’s hard disk in your preferred directory location:
As a java expert, Your task on this project is to implement a console
application to downlaod whole web application. INSTRUCTIONS · Receive and validate the URL to the home page as a primary input to the system and create a folder of the same name as the domain name contained by that URL. · Download the home page pointed to by the URL and store it in its filename; if the name can not be extracted, name it as “index.html” and continue. · Visit the downloaded home page file by extracting all external links, record links in a relational DBMS and update both the Console with the list stored of extracted external links . · For each external link extracted from step 3., download a corresponding resource/file by showing the resource download progress report from both the console . The link download progress report to be printed is Described at the end of this document.
· At the end of the website download and in the future,
present a download completion report showing the name of the downloaded website, the download start and end date time for the whole website download. The report would also show each link and how many kilobytes were downloaded and in how much time in milliseconds did the link’s resource download take to complete. · For each downloaded website, the report produced in step 5 has to be stored in a relational DBMS of your choice, and can be presented again in future by querying the database by website’s name. · · If the website is made of different directories, downloaded files will have to be in their corresponding directories as from the remote server. · At list two database tables are required to complete this task:
-A website table to keep the summary of the download
report has at list 6 columns: (id,website_name,download_start_date_time,download_en d_date_time,total_elapsed_time,total_downloaded_kilobyt es) -A link table to keep records of external links has at list five columns: (id,link_name,website_id,total_elapsed_time,total_downloa ded_kilobytes)