Introduction to WWW/HTML

Welcome to the Introduction to WWW/HTML online class notes. A very similar tutorial has been given to IBM Rochester, University of Minnesota and a few companies in the Mid West. Please note that this is the master copy and will be updated from time to time.

 

Agenda

  1. Your first step to be a Webmaster ???
  2. Start from your PC
  3. From PC to Local Area Network (LAN)
  4. From LAN to the world (Internet)
  5. Building a part of the Web/WWW on top of the Internet
  6. What is WWW/HTML ? (Why do we Care ?)
  7. Basic Concept
  8. Getting Start on Different Platforms
  9. What Do We Do With Existing Docs ?
  10. Images and More

 

 

Introduction


Our Goal

Basic understanding of why WWW exist

Basic understanding of what the players are (browsers, network protocol, web server, etc)

Basic coding of HTML and Why it works !

 

 

Start from Your PC

The "Name" of the PC - My Computer

The "Languages" of the PC can talk - Network Neigborhood

The "Phones" available on the PC - Your Web Browser

 

 

From PC to LAN

Computer that link together in a local area (Using ethenet, token ring)

Computer that talk the same language (NETBEUI in Microsoft Network, IPX in NetWare, TCP/IP in the Internet)

Printer Sharing

 

 

From LAN to the Internet

Computer talks the Internet Language - TCP/IP

Work:

Home:

 

 

Building a part of Web on Top of the Internet

A killer app that talks a very friendly network language (HTTP)

Easy access to information

Easy exchange of information

User friendly "phone" - your browsers

We need the network language (protocol) that enable one another to talk

We need another language that create document

etc.

 

 

World Wide Web as an Internet Concept

As the information superhighway (Internet) glows exponentially over the years, customers of the superhighway demand a need for one tool, one program, one appli cation that will take it from being a much-hyped but difficult-to-use linking of com puters around the world to being a highly informative, highly usable database and communications tool. The spreadsheet was the killer app for PCs a long time ago, but so far the Net doesn't have one. Some have given a potential "killer app" status to a program called Mosaic, Netscape or Microsoft IE.

What the superhighway does have at this point is a killer concept called World Wide Web (WWW). The WWW provides the technology needed to offer a navigable, attractive interface for the Internet's vast sea of resources, in much the same way that the toolbar on a word processor screen obscures the intimidating codes that the program actually consists of. The most accurate terminology might be meta-interface-an interface that incorporates other interfaces. WWW is a concept, not a program, not a system, and not even a specific protocol. It is the concept that brings internet users together through different applications and interface to access infor mation from the internet.

Nowadays, advertisements and editorial content in many publications such as Newsweek, Time, The New York Time, and the Wall Street Journal now routinely use the Web's identifying scheme, the Universal Resource Locator, as a means for directing the user to further information. Small business, organizations, even US White House, and individuals all over the world also use the Web for communication, information, and interaction.

This intense interest in the Web is a result of the potential it offers for communication. Using the Web, individuals or organizations can instantaneously and continuously present hypermedia--text images, movies, and sound-to a GLOBAL audience. For Web developers, this information environment demands excellent, effective content development in order to rise above the information clutter. With so much information on the Web, only that which truly meets user needs well can survive and flourish.






Sample Links on the Web






The HTML Language

HTML is a subset of the SGML, or Standard Generalized Markup Language. HTML is now the "standard" code used in the internet to make documents readable across a variety of platforms and software. HTML operates through a series of codes placed within an ASCII document. These codes are translated by a "World Wide Web Client" such as Mosaic, Netscape, OS2 WWW Explorer or MacWeb into specific kinds of formats to be displayed on the screen, and on which the user can act (for example, send a mail).

HTML is a glowing language, it stays around and more complex it is becoming. Please refer to the lastest documentation for update.











Basic Concept











What an HTML Document Is

HTML documents are plain-text (also known as ASCII) files that can be created using any text editor (e.g., Emacs or vi on UNIX machines; BBEdit on a Macintosh; Notepad on a Windows machine). You can also use word-processing software if you remember to save your document as "text only with line breaks."









HTML Editors

Some WYSIWYG editors are available (e.g., HotMetal, which is available for several platforms or Adobe PageMill for Macintoshes). You may wish to try one of them after you learn some of the basics of HTML tagging. It is useful to know enough HTML to code a document before you determine the usefulness of a WYSIWYG editor.

If you haven't already selected your software, refer to an online listing of HTML editors (organized by platform) to help you in your search for appropriate software.









Getting Your Files on a Server

If you have access to a Web server at school or work, contact your webmaster (the individual who maintains the server) to s ee how you can get your files on the Web. If you do not have access to a server at work or school, check to see if your community operates a FreeNet, a community-based network that provides free access to the Internet. Lacking a FreeNet, you may need to contact a local Internet provider that will post your files on a server for a fee. (Check your local newspaper for advertise ments or with your Chamber of Commerce for the names of companies.)









Tags Explained

An element is a fundamental component of the structure of a text document. Some examples of elements are heads, tables, paragraphs, and lists. Think of it this way: you use HTML tags to mark the elements of a file for your browser. Elements can contain plain text, other elements, or both.

To denote the various elements in an HTML document, you use tags. HTML tags consist of a left angle bracket (<), a tag name, and a right angle bracket (>). Tags are usually paired (e.g., <H1> and </H1>) to start and end the tag instruction. The end tag looks just like the start tag except a slash (/) precedes the text within the brackets.

Some elements may include an attribute, which is additional information that is included inside the start tag. For example, you can specify the alignment of images (top, middle, or bottom) by including the appropriate attribute with the image source HTML code.

NOTE: HTML is not case sensitive. <title> is equivalent to <TITLE> or <TiTlE>.

Not all tags are supported by all World Wide Web browsers. If a browser does not support a tag, it (usually) just ignores it.









The Minimal HTML Document

Every HTML document should contain certain standard HTML tags. Each document consists of head and body text. The head contains the title, and the body contains the actual text that is made up of paragraphs, lists, and other elements. Browsers expect specific information because they are programmed according to HTML and SGML specifications.

Required elements are shown in this sample bare-bones document:

    <html>
    <head>
    <TITLE>A Simple HTML Example</TITLE>
    </head>
    <body>
    <H1>HTML is Easy To Learn</H1>
    <P>Welcome to the world of HTML.
    This is the first paragraph. While short it is
    still a paragraph!</P>
    <P>And this is the second paragraph.</P>
    </body>
    </html>

The required elements are the <html>, <head>, <title>, and <body> tags (and their corresponding end tags). Because you should include these tags in each file, you might want to create a template file with them. (Some browsers will format your HTML file correctly even if these tags are not included. But some browsers won't! So make sure to include them.)

 

What About HyperLink

Hyperlinks (Hypertext links) are why we can jump from one document to another

This is a absolute hyperlink
<
a href="http://www.yahoo.com">Using a Search Engine</a>

This is a relative hyperlink
<
a href="filename.htm">Linking to a local link</a>

 

The Other Kind of Links - Anchors

Build the marker for the anchors, the <A> Tag:

<A NAME="MyPagePart"> Click Me </A>

A link to an external marker:

<A HREF="filename.html#MyPagePart">Jump to an External Anchor</A>

A link to an internal marker:

<A HREF="#MyPagePart"> Jump to an Internal Anchor</A>

Example:

The following link will jump to the top of this page. At the top of the page, we define:
<h1 align="center"><a name="Top">Introduction to WWW/HTML</a></h1>

 

And the link above actually has the coding of
<
a href="#Top">link</font></a>

 

 

List, List, List

Order List - Number will show up:

<OL> ... </OL>

Unorder List - Bulleted List

<UL> ... </UL>

The element within List

<LI>

Definition List

<DL>

<DT>Term Name<DD> Term Definition

<DT>Term Name 2<DD> Term Definition 2

</DL>




A Teaching Tool

To see a copy of the file that your browser reads to generate the information in your current window, select View Source (or the equivalent) from the browser menu. The file contents, with all the HTML tags, are displayed in a new window.

This is an excellent way to see how HTML is used and to learn tips and constructs. Of course, the HTML might not be technically correct. Once you become familiar with HTML and check the many online and hard-copy references on the subject, you will learn to distinguish between "good" and "bad" HTML.

Remember that you can save a source file with the HTML codes and use it as a template for one of your Web pages or modify the format to suit your purposes.











Getting Start with Your Web Page









Getting Start in AIX

With an valid userid, e.g. ibmjoeuser. Sign on as ibmjoeuser and do the following steps to create a homepage under IBM Rochester:







Getting Start on OS/2

A list of files are available under "gump.watson.ibm.com(9.2.11.26)/pub/httpdos2/." The following is a sample list:

  README                    2 Kb    Tue Apr 11 00:00:00 1995 
  code10.zip              452 Kb    Tue Apr 11 00:00:00 1995 
  core                     29 Kb    Tue Oct  1 06:31:00 1996 
  misc/                             Thu Jun 15 00:00:00 1995 Directory
  ncsadoc.zip             549 Kb    Wed Mar  8 00:00:00 1995 
  patch10.zip             105 Kb    Tue Apr 11 00:00:00 1995 
  src10.zip                91 Kb    Tue Apr 11 00:00:00 1995 
  unzip.exe                90 Kb    Fri Feb 17 00:00:00 1995 

The following is the installation guide:

Quick Installation Guide

1. Copy the package file code10.zip to the root directory of your choice.
   HPFS drive is much preferred, but it will also work on a FAT drive.

2. You must use PKZIP 204g compatible decompression program.
   Do not use a DOS-mode archiver because HTTPD for OS/2 contains 
   long file names.  InfoUnzip for OS/2 works best.
   A copy of InfoUnzip for OS/2 can be found in the same place
   you found this package.

3. Unzip the package file with the -d switch so that the correct
   directory tree will be created.

     unzip -d code10.zip

4. CD to httpd, and start the server.

     httpd

5. Now start your favorite web browser, and off you go!

6. Optionally, you can install NCSA documentation online.  Copy ncsadoc.zip
   to the same root directory and issue unzip -d ncsadoc.zip.
   When unzip asks if you want to replace an "index.html" file, reply yes.

7. If your document root is on a HPFS drive, you may want to turn
   "FATTranslation" off in conf\srm.cf.  You can just comment out the line.

Note: After installation, your home page will be in c:\httpd\www\index.html





Getting Start on AS/400

Setting Up an HTTP Server On An AS/400

  1. Please make sure that TCP/IP is running.
  2.  
  3. Set Up the Welcome(Home) Page
    1. Create a library to hold http stuff (e.g. HTTP):
      CRTLIB LIB(HTTP)
    2. Create a source physical file to hold html documents (e.g. HTDOCS):
      CRTSRCPF FILE(HTTP/HTDOCS)
    3. The server acts as user QTMHHTTP, so this user must be given access to the HTDOCS file and its members:
      CHGAUT OBJ('/QSYS.LIB/HTTP.LIB/HTDOCS.FILE') USER(QTMHHTTP) DTAAUT(*RX) OBJAUT(*ALL)
    4. Use SEU to create and edit the WELCOME file:
      STRSEU SRCFILE(HTTP/HTDOCS) SRCMBR(WELCOME) TYPE(HTML)
    5. Enter HTML source into the file.
          <HTML>
      
        	<HEAD>
          	   <TITLE> Welcome to AS400 Home Page! </TITLE>
        	</HEAD>
      
        	<BODY>
          	   <H1 align=center>Hi.</H1>
        	</BODY>
      
          </HTML>
        
  4. Set Up the HTTP Server
    1. Use the http configuration editor:
      WRKHTTPCFG
    2. The following is a sample of the setup file:
      	Pass /* /QSYS.LIB/HTTP.LIB/HTDOCS.FILE/*
      	AlwaysWelcome On
      	AccessLog WWWAccess
      	ErrorLog WWWError
          
    3. Start the server:
      STRTCPSVR SERVER(*HTTP)
  5.  
  6. Browse the page using any web browser. (e.g. Go to URL: http://rchas69a/sinnhome.mbr)
  7.  
  8. If it does not work, go back and make sure all the above steps are followed. Now ... we can add some images
    1. Set up the directory
      1. For example, we can put all the images in directory /htdocs/images:
        MKDIR '/htdocs'
      2. Give QTMHHTTP access to that directory by the following command:
        CHGAUT OBJ('/htdocs') USER(QTMHHTTP) DTAAUT(*RX) OBJAUT(*ALL)
      3. Make the subdirectory:
        MKDIR '/htdocs/images'
    2. Get some pictures
      I got mine by ftp'ing some gif files from the AFS directory /usr/common/www/w3root/images
    3. Modify your home page
      The tag for an image looks like this: <IMG src="/images/filename.gif">
    4. Modify the http configuration file
      1. Start config editor:
        WRKHTTPCFG
      2. Add a Pass statement to the top of the file:
        Pass /images/* /htdocs/images/*
    5. Restart HTTP server:
      STRTCPSVR SERVER(*HTTP) RESTART(*HTTP)
  9.  
  10. Test it with your web browser again.
  11.  
  12. Let's say you want to add another web page to your machine:
    1. Use SEU to create and edit your new file:
      STRSEU SRCFILE(HTTP/HTDOCS) SRCMBR(SCISYS) TYPE(HTML)
    2. To add a link from your homepage to this file, use this tag:
      <A HREF="/scisys.mbr"> Other page </A>

More hints:


Getting Start on Windows

Use of Personal Web Server for Testing (Download from Microsoft)

Use of an Editor

Use of Web Browser

(Put it up on the server)

 






HTML Conversion

EZ to HTML

ez2html: Converts datastream files into WWW documents
What ez2html is
Ez2html reads any datastream file (with styles and insets, created by EZ), or plain ASCII file, or source code, converts that input into HTML+ tag format, and outputs the result to stdout.

"HTML+ tag format" is the markup used by documents on the World Wide Web.  See the  htmltext  help for more details.
Starting ez2html
To convert a file to HTML+, invoke the following command:

ez2html [-h] [-o dataobj] [infile]
-h Displays usage help.

-o dataobj Tells the converter which dataobject to use, in case the file extension is not mapped in /usr/andrew/lib/global.ezinit.  See the  ez-extensions  help for more info.

infile Name of file to read as input.  If omitted, stdin will be used.
Examples
ez2html -o ctext <~/bin/myshell> ~/WWW/myshell.html

BMU : Bookmaster to HTML

 1.1  Why the BM Utilities?

   The DynaMIT project is a travel behaviour simulation written in C++, and
   intended to be built upon by the author and others to perform various
   experiments on travel behaviour.  The framework must be maintainable and
   documentation is an essential part of that.

 

 2.1  Invoking bmu

 2.1.1  Command-Line Syntax

   You can invoke the bmu program with syntax like the following:

     bmu  fn<.ext>
             fn      filename of master input file
             ext     extension of master input file - default .scr
     Options:
             -f format
                     select output format: can be
                     html    format for WWW HTML browsing - the default
                     latex   format for LaTeX processing
             -m mainfn
                     specify main file - mainfn.toc will be used as toc input
        -od outdir
             specify output directory - default current directory

 

Others

Using export functions from Word Processer (MS Word, Lotus Word Pro, etc)

Using export functions from presentation Tools

Web Editing Tools





Inline Images and More

Most Web browsers can display inline images (that is, images next to text) that are in X Bitmap (XBM), GIF, or JPEG format. Other image formats are being incorporated into Web browsers [e.g., the Portable Network Graphic (PNG) format]. Each image takes time to process and slows down the initial display of a document. Carefully select your images and the number of images in a document.

To include an inline image, enter:

    <IMG SRC=ImageName>

where ImageName is the URL of the image file.

The syntax for <IMG SRC> URLs is identical to that used in an anchor HREF. If the image file is a GIF file, then the filename part of ImageName must end with .gif. Filenames of X Bitmap images must end with .xbm; JPEG image files must end with .jpg or .jpeg; and Portable Network Graphic files must end with .png.







Image Size Attributes

You should include two other attributes on <IMG> tags to tell your browser the size of the images it is downloading with the text. The HEIGHT and WIDTH attributes let your browser set aside the appropriate space (in pixels) for the images as it downloads the rest of the file. (Get the pixel size from your image-processing software, such as Adobe Photoshop.)

For example, to include a self portrait image in a file along with the portrait's dimensions, enter:

    <IMG SRC=SelfPortrait.gif HEIGHT=100 WIDTH=65>

NOTE: Some browsers use the HEIGHT and WIDTH attributes to stretch or shrink an image to fit into the allotted space when the image does not exactly match the attribute numbers. Not all browser developers think stretching/shrinking is a good idea. So don't plan on your readers having access to this feature. Check your dimensions and use the correct ones.







Aligning Images

You have some flexibility when displaying images. You can have images separated from text and aligned to the left or right or centered. Or you can have an image aligned with text. Try several possibilities to see how your information looks best.

Aligning Text with an Image
By default the bottom of an image is aligned with the following text, as shown in this paragraph. You can align images to the top or center of a paragraph using the ALIGN= attributes TOP and CENTER.

This text is aligned with the top of the image (<IMG SRC = "5.hkbut.gif" ALIGN=TOP>). Notice how the browser aligns only one line and then jumps to the bottom of the image for the rest of the text.

And this text is centered on the image (<IMG SRC = "5.hkbut.gif" ALIGN=CENTER>). Again, only one line of text is centered; the rest is below the image.

Images without Text
To display an image without any associated text (e.g., your organization's logo), make it a separate paragraph. Use the paragraph ALIGN= attribute to center the image or adjust it to the right side of the window as shown below:

<p ALIGN=CENTER>
<IMG SRC = "5.hkbut.gif">
</p>

which results in:

The image is centered; this paragraph starts below it and left justified.









Alternate Text for Images

Some World Wide Web browsers--primarily those that run on VT100 terminals--cannot display images. Some users turn off image loading even if their software can display images (especially if they are using a modem or have a slow connection). HTML provides a mechanism to tell readers what they are missing on your pages.

The ALT attribute lets you specify text to be displayed instead of an image. For example:

    <IMG SRC="UpArrow.gif" ALT="Up">

where UpArrow.gif is the picture of an upward pointing arrow. With graphics-capable viewers that have image-loading turned on, you see the up arrow graphic. With a VT100 browser or if image-loading is turned off, the word Up is shown in your window.

You should try to include alternate text for each image you use in your document, which is a courtesy for your readers.







Background Graphics

Newer versions of Web browsers can load an image and use it as a background when displaying a page. Some people like background images and some don't. In general, if you want to include a background, make sure your text can be read easily when displayed on top of the image.

Background images can be a texture (linen finished paper, for example) or an image of an object (a logo possibly). You create the background image as you do any image.

However you only have to create a small piece of the image. Using a feature called tiling, a browser takes the image and repeats it across and down to fill your browser window. In sum you generate one image, and the browser replicates it enough times to fill your window. This action is automatic when you use the background tag shown below.

The tag to include a background image is included in the <BODY> statement as an attribute:

<BODY BACKGROUND="filename.gif">









Background Color

By default browsers display text in black on a gray background. However, you can change both elements if you want. Some HTML authors select a background color and coordinate it with a change in the color of the text.

Always preview changes like this to make sure your pages are readable. (For example, many people find red text on a black background difficult to read!)

You change the color of text, links, visited links, and active links using attributes of the <BODY> tag. For example, enter:

<BODY BGCOLOR="#000000" TEXT="#FFFFFF" LINK="#9690CC">

This creates a window with a black background (BGCOLOR), white text (TEXT), and silvery hyperlinks (LINK).

The six-digit number and letter combinations represent colors by giving their RGB (red, green, blue) value. The six digits are actually three two-digit numbers in sequence, representing the amount of red, green, or blue as a hexadecimal value in the range 00-FF. For example, 000000 is black (no color at all), FF0000 is bright red, and FFFFFF is white (fully saturated with all three colors). These number and letter combinations are cryptic. Fortunately an online resource is available to help you track down the combinations that map to specific colors:









External Images, Sounds, and Animations

You may want to have an image open as a separate document when a user activates a link on either a word or a smaller, inline version of the image included in your document. This is called an external image, and it is useful if you do not wish to slow down the loading of the main document with large inline images.

To include a reference to an external image, enter:

    <A HREF="MyImage.gif">link anchor</A>

You can also use a smaller image as a link to a larger image. Enter:

     <A HREF="LargerImage.gif"><IMG SRC="SmallImage.gif"></A>

The reader sees the SmallImage.gif image and clicks on it to open the LargerImage.gif file.

Use the same syntax for links to external animations and sounds. The only difference is the file extension of the linked file. For example,

<A HREF="AdamsRib.mov">link anchor</A>

specifies a link to a QuickTime movie. Some common file types and their extensions are:

File Type Extension
plain text .txt
HTML document .html
GIF image .gif
TIFF image .tiff
X Bitmap image .xbm
JPEG image .jpg or .jpeg
PostScript file .ps
AIFF sound file .aiff
AU sound file .au
WAV sound file .wav
QuickTime movie .mov
MPEG movie .mpeg or .mpg

Keep in mind your intended audience and their access to software. Most UNIX workstations, for instance, cannot view QuickTime movies.




 

Lab

Create a Web Page just like the following picture:

Image 1
Image 2
Image 3

Note:

 

Copyright 1996-2001 OpenLoop Computing. All rights reserved.