[Resolved with inet control]internetreadfile

**srisa** · Apr 23rd, 2006, 05:28 AM

Hello,
I am using internetreadfile api to download the code of a webpage. I have set the buffer size to 1024. I am writing the each downloaded line to a textfile. Problem is the result is not the same for all executions of the program. Some times it is working fine. Sometimes, some additional characters are being added. Some times a part of the line is being printed to the file more than once. Is this expected or am I doing something wrong here?
Thank you.

**kazar** · Apr 24th, 2006, 08:49 AM

Is there a reason why you aren't using the Inet Control? It's a lot more reliable.

**shirazamod** · Apr 24th, 2006, 09:20 AM

Maybe srisa doesn't want to use a control.
I personally like the Inet control although it isn't always reliable

**srisa** · Apr 24th, 2006, 09:50 AM

If I am designing the application for windows : If windows is the OS being used , we can expect Internet Explorer to be used. When IE is used wininet.dll will be present on the user's system. If the application is downloaded from net, using api's will reduce the size of the application.
Sounds pretty hi-fi.
Well, real answer is I am learning VB. As part of it I am trying to use api's. Internetreadfile makes the entry here.

**shirazamod** · Apr 24th, 2006, 10:09 AM

That's a good enough reason.
Unfortunately I have no experience with this API so I can't help, sorry.

**kazar** · Apr 24th, 2006, 11:09 AM

The function below should download the entire file to a file of your choice, not matter what the buffer size you choose.

VB Code:

Option Explicit
Const INTERNET_OPEN_TYPE_DIRECT = 1
Const INTERNET_OPEN_TYPE_PROXY = 3
Const INTERNET_FLAG_RELOAD = &H80000000
 
Private Declare Function InternetOpen Lib "wininet" Alias "InternetOpenA" (ByVal sAgent As String, ByVal lAccessType As Long, ByVal sProxyName As String, ByVal sProxyBypass As String, ByVal lFlags As Long) As Long
Private Declare Function InternetCloseHandle Lib "wininet" (ByVal hInet As Long) As Integer
Private Declare Function InternetReadFile Lib "wininet" (ByVal hFile As Long, ByVal sBuffer As String, ByVal lNumBytesToRead As Long, lNumberOfBytesRead As Long) As Integer
Private Declare Function InternetOpenUrl Lib "wininet" Alias "InternetOpenUrlA" (ByVal hInternetSession As Long, ByVal lpszUrl As String, ByVal lpszHeaders As String, ByVal dwHeadersLength As Long, ByVal dwFlags As Long, ByVal dwContext As Long) As Long
Private Sub GetFile(sourcefile As String, destfile As String, buffersize As Long)
  
 
    Dim hOpen As Long, hFile As Long, sBuffer As String, Ret As Long, oldbuffer As String
    'Create a buffer for the file we're going to download
    sBuffer = Space(buffersize)
    'Create an internet connection
    hOpen = InternetOpen(App.EXEName, INTERNET_OPEN_TYPE_DIRECT, vbNullString, vbNullString, 0)
    'Open the url
    hFile = InternetOpenUrl(hOpen, sourcefile, vbNullString, ByVal 0&, INTERNET_FLAG_RELOAD, ByVal 0&)
    Open destfile For Output As #1
    
    Do
    oldbuffer = sBuffer
    InternetReadFile hFile, sBuffer, buffersize, Ret
    If sBuffer = oldbuffer Then GoTo cleanup
    Print #1, sBuffer
    
    Loop
    
cleanup:
    InternetCloseHandle hFile
    InternetCloseHandle hOpen
    Close
End Sub

Hope that works, ask if you have any queries

**srisa** · Apr 25th, 2006, 06:15 AM

Thank you.The function as such is working fine. But , how do I check for errors during download like connection timeout, network disconnection or such things. During one of the trial runs, session got timed out, result being incomplete file download.

**kazar** · Apr 25th, 2006, 08:33 AM

Erm...You could use the checkinternetconnection function. I'll check when i get back home, and post the fix.

**kazar** · Apr 25th, 2006, 12:11 PM

VB Code:

Option Explicit
Const INTERNET_OPEN_TYPE_DIRECT = 1
Const INTERNET_OPEN_TYPE_PROXY = 3
Const INTERNET_FLAG_RELOAD = &H80000000
Private Const FLAG_ICC_FORCE_CONNECTION = &H1
Private Declare Function InternetCheckConnection Lib "wininet.dll" Alias "InternetCheckConnectionA" (ByVal lpszUrl As String, ByVal dwFlags As Long, ByVal dwReserved As Long) As Long
 
Private Declare Function InternetOpen Lib "wininet" Alias "InternetOpenA" (ByVal sAgent As String, ByVal lAccessType As Long, ByVal sProxyName As String, ByVal sProxyBypass As String, ByVal lFlags As Long) As Long
Private Declare Function InternetCloseHandle Lib "wininet" (ByVal hInet As Long) As Integer
Private Declare Function InternetReadFile Lib "wininet" (ByVal hFile As Long, ByVal sBuffer As String, ByVal lNumBytesToRead As Long, lNumberOfBytesRead As Long) As Integer
Private Declare Function InternetOpenUrl Lib "wininet" Alias "InternetOpenUrlA" (ByVal hInternetSession As Long, ByVal lpszUrl As String, ByVal lpszHeaders As String, ByVal dwHeadersLength As Long, ByVal dwFlags As Long, ByVal dwContext As Long) As Long
Private Sub GetFile(sourcefile As String, destfile As String, buffersize As Long)
  
 
    Dim hOpen As Long, hFile As Long, sBuffer As String, Ret As Long, oldbuffer As String
    'Create a buffer for the file we're going to download
    sBuffer = Space(buffersize)
    'Create an internet connection
    hOpen = InternetOpen(App.EXEName, INTERNET_OPEN_TYPE_DIRECT, vbNullString, vbNullString, 0)
    'Open the url
    hFile = InternetOpenUrl(hOpen, sourcefile, vbNullString, ByVal 0&, INTERNET_FLAG_RELOAD, ByVal 0&)
    Open destfile For Output As #1
    
    Do
         If InternetCheckConnection(sourcefile, FLAG_ICC_FORCE_CONNECTION, 0) = False Then
               Do Until InternetCheckConnection(sourcefile, FLAG_ICC_FORCE_CONNECTION, 0) = True
               Loop
         End If
    oldbuffer = sBuffer
    InternetReadFile hFile, sBuffer, buffersize, Ret
    If sBuffer = oldbuffer Then GoTo cleanup
    Print #1, sBuffer
    
    Loop
    
cleanup:
    InternetCloseHandle hFile
    InternetCloseHandle hOpen
    Close
End Sub

Right then, that should, if it can't connect to the desired page, for whatever reason, it will hang until it can. Then it will continue download.

Hope thats what you're looking for.

**shirazamod** · Apr 25th, 2006, 12:34 PM

You should use DoEvents within it and add a timeout incase the user isn't connected to the net

**kazar** · Apr 25th, 2006, 12:47 PM

First off, it's only rough code ¬_¬

Secondly, he's right, so here you go -

VB Code:

Option Explicit
Const INTERNET_OPEN_TYPE_DIRECT = 1
Const INTERNET_OPEN_TYPE_PROXY = 3
Const INTERNET_FLAG_RELOAD = &H80000000
Private Const FLAG_ICC_FORCE_CONNECTION = &H1
Private Declare Function InternetCheckConnection Lib "wininet.dll" Alias "InternetCheckConnectionA" (ByVal lpszUrl As String, ByVal dwFlags As Long, ByVal dwReserved As Long) As Long
 
Private Declare Function InternetOpen Lib "wininet" Alias "InternetOpenA" (ByVal sAgent As String, ByVal lAccessType As Long, ByVal sProxyName As String, ByVal sProxyBypass As String, ByVal lFlags As Long) As Long
Private Declare Function InternetCloseHandle Lib "wininet" (ByVal hInet As Long) As Integer
Private Declare Function InternetReadFile Lib "wininet" (ByVal hFile As Long, ByVal sBuffer As String, ByVal lNumBytesToRead As Long, lNumberOfBytesRead As Long) As Integer
Private Declare Function InternetOpenUrl Lib "wininet" Alias "InternetOpenUrlA" (ByVal hInternetSession As Long, ByVal lpszUrl As String, ByVal lpszHeaders As String, ByVal dwHeadersLength As Long, ByVal dwFlags As Long, ByVal dwContext As Long) As Long
Public timeoutcount as long
Private Sub GetFile(sourcefile As String, destfile As String, buffersize As Long, timeoutlevel as long)
 
 
 
    Dim hOpen As Long, hFile As Long, sBuffer As String, Ret As Long, oldbuffer As String
    'Create a buffer for the file we're going to download
    sBuffer = Space(buffersize)
    'Create an internet connection
    hOpen = InternetOpen(App.EXEName, INTERNET_OPEN_TYPE_DIRECT, vbNullString, vbNullString, 0)
    'Open the url
    hFile = InternetOpenUrl(hOpen, sourcefile, vbNullString, ByVal 0&, INTERNET_FLAG_RELOAD, ByVal 0&)
    Open destfile For Output As #1
    
    Do
         If InternetCheckConnection(sourcefile, FLAG_ICC_FORCE_CONNECTION, 0) = False Then
 
timeoutcount = 0
timeouttimer.enabled = True
timeouttimer.interval = 100
 
               Do Until InternetCheckConnection(sourcefile, FLAG_ICC_FORCE_CONNECTION, 0) = True
               Doevents
If timeoutcount = timeoutlevel then
        msgbox("Connection Timed Out")
        timeouttimer.enabled = False
        Exit Sub
End If
               Loop
timeouttimer.enabled = False
         End If
    oldbuffer = sBuffer
    InternetReadFile hFile, sBuffer, buffersize, Ret
    If sBuffer = oldbuffer Then GoTo cleanup
    Print #1, sBuffer
    
    Loop
    
cleanup:
    InternetCloseHandle hFile
    InternetCloseHandle hOpen
    Close
End Sub
 
Sub timeouttimer_Timer()
 
      timeoutcount = timeoutcount + 1
 
End sub

That should, if it has to wait a set length of time for a connection, popup a timeout message and exit sub. You have to make a timer and call it timeouttimer, though.

**shirazamod** · Apr 25th, 2006, 01:07 PM

Looks good [for rough code]

**kazar** · Apr 25th, 2006, 01:10 PM

Hmm, though i still think i prefer inet, at least it always pauses your code until it's done

**srisa** · Apr 26th, 2006, 10:18 AM

Well, before using internetopen, I am checking the connection using internetcheckconnection. If connection could be established start downloading , otherwise no. To check if the entire file is downloaded, I am storing the entire file in a string variable and using the instr function to see if </html> tag is present. If it is present then entire file has been downloaded otherwise no. Is this approach ok or am I overlooking or missing something here? And , thanks for the interest. Earlier I have posted three or four times in api section but this is the first time more than one person has taken time to post.

**kazar** · Apr 26th, 2006, 11:43 AM

That should do it fine.

**srisa** · Apr 27th, 2006, 10:00 AM

Time now for the specifics. I came across a project : at music.yahoo.com site , there will be list of videos alphabetically separated. What he wants is a list of all the videos listed there in this format : artistname | video name | a number. Number : when we hover the mouse over the video name, in the status bar, javascript : playvideo(23457181) , some such number will be displayed. This is the number that is needed. Artist name and videoname are hyperlinks. What I am doing is downloading the entire file and searching for these strings
search1 = "http://music.yahoo.com/ar-"
search2 = "javascript : playVideos"
and trying to retrieve the values. Is this the right approach? If you want I will upload the project that I have written.

**kazar** · Apr 27th, 2006, 12:30 PM

meaning you want to retrieve whatever comes after that value, until...

**srisa** · Apr 28th, 2006, 07:17 AM

This is what the a tag for artist looks like , from this the value between the greater than and less than signs ( inner text ) is what I retrieve :
<a href="http://music.yahoo.com/ar-298276---The-International-Noise-Conspiracy" title="The (International) Noise Conspiracy">The (International) Noise Conspiracy</a>.
This is what it looks like for javascript thing:
<a href="javascript

layVideos(2153097)" class="listheader" title="Reproduction Of Death">Reproduction Of Death</a>
The value between parantheses gives the number and the value between > and < signs ( inner text again ) gives the album title. I am instr function to retrieve the positions of the symbols and then mid$ function to get the value I want.

**kazar** · Apr 28th, 2006, 07:22 AM

<a href="http://music.yahoo.com/ar-298276---The-International-Noise-Conspiracy" title="The (International) Noise Conspiracy">The (International) Noise Conspiracy</a>.
This is what it looks like for javascript thing:
<a href="javascriptlayVideos(2153097)" class="listheader" title="Reproduction Of Death">Reproduction Of Death</a>

If the name of album/song is the same for both, are the bold numbers both the same.

**srisa** · Apr 28th, 2006, 07:28 AM

The number after ar is the artist number which is being used by the programmer for easy reference to the artist. First value will be the artist name. Value between parantheses is the number, last value will be the title. This string http://music.yahoo.com/ar is common for all the hyperlinks refering to the artists. So I am searching for that text to get the artist's name. The number is parantheses might be some sort of reference to the video in the database or something like that.

**srisa** · May 6th, 2006, 11:35 AM

Sorry for going into hibernation for so long. Extracting the string part is ok. But the problem is the downloaded content is not coherent. The code is not in order and some of the lines appear two times etc. I changed the extension of the file to html and opened it in IE. The page was not similar to the page downloaded.There were some errors.
May be using internetreadfile is complicated. Is there any way to know if the downloaded file is in order.

**kazar** · May 6th, 2006, 12:15 PM

use the inet control.

VB Code:

filestr = Inet1.OpenURL("address")
 
Open "C:\savefile.html" for output as #1
    Print #1, filestr
Close

**srisa** · May 8th, 2006, 11:06 AM

When I use the above only part of the file is being read.
I did this and it is working fine.

VB Code:

Private Sub cmdget_Click()
Inet1.URL = surl
Inet1.Execute
End Sub
 
 
Private Sub Form_Load()
Inet1.Protocol = icHTTP
End Sub
 
Private Sub Inet1_StateChanged(ByVal State As Integer)
 
Static i As Integer
Dim stemp As String
Dim bdone As Boolean
Dim fh As Long
 
stemp = vbNullString
bdone = False
 
Select Case State
Case icError
  MsgBox ("An error occurred while downloading the information")
 
Case icConnected
  Form1.Caption = "Connected to the site"
  
Case icConnecting
  Form1.Caption = "connecting to the site"
 
Case icDisconnected
  MsgBox ("Disconnected from the site")
  
Case icDisconnecting
  Form1.Caption = "disconnecting from the site"
Case icHostResolved
  Form1.Caption = "host resolved"
Case icNone
  Form1.Caption = "no message"
Case icRequesting
  Form1.Caption = "requesting"
Case icRequestSent
  Form1.Caption = "request sent"
Case icResolvingHost
  Form1.Caption = "resolving host"
  
Case icResponseCompleted
  stemp = Inet1.GetChunk(1024, icString)
  DoEvents
  
  Do While Not bdone
    sdata = sdata & stemp
    stemp = Inet1.GetChunk(1024, icString)
    DoEvents
    If Len(stemp) = 0 Then
      bdone = True
    End If
  Loop
  MsgBox ("Received the response in full")
  Text1.Text = sdata
  fh = FreeFile
  Open App.Path & "\musicvideo.txt" For Output As fh
  Print #fh, sdata
  Close #fh
 
Case icResponseReceived
  i = i + 1
  Form1.Caption = "received the response " & CStr(i)
 
Case icReceivingResponse
  Form1.Caption = "Receiving response"
End Select
End Sub

Would you suggest anything else for still better performance?

**manavo11** · May 14th, 2006, 04:16 PM

What do you mean still better performance? To read/download the file faster?

**srisa** · May 15th, 2006, 08:17 AM

Yes . Faster download and to see if any part of code is not necessary. And if you have patience to start all over again with internetreadfile: Why is it that it gets the data with duplications and other distortions?

**manavo11** · May 15th, 2006, 04:23 PM

Sorry, but I have no idea why the downloaded content wouldn't match the original source... It shouldn't since it just downloads the html code from what I understand, right? So after that, opening the downloaded data in a browser it should be identical?

As for speed, the only restriction should be your internet connection. There is no way to speed that up at all...

**srisa** · May 16th, 2006, 07:15 AM

I am attaching the zipped project. In that google.com homepage is downloaded using both the internetreadfile and inet. Result from each will be written a separate file with names usingapi.html and usinginet.html.
When I open them in IE. Result from Inet is more coherent and nearer to the actual page when compared with api one. Size of the file is small so there aren't many discrepancies. As the filesize grows so do the distortions.
Please have a look at the project and let me know if I am doing something wrong with internetreadile method.

**manavo11** · May 25th, 2006, 07:06 AM

Could it be that the site uses CSS and the css file isn't downloaded with the html so when you run it locally it looks wrong?

**kazar** · May 25th, 2006, 07:11 AM

true, but that doesn't explain the discrepancies between api and inet

**manavo11** · May 25th, 2006, 08:22 AM

Originally Posted by kazar

true, but that doesn't explain the discrepancies between api and inet

Good point, both should have to be equally wrong

I'm out of ideas

**srisa** · May 25th, 2006, 09:58 AM

Ok, we don't seem to be making much headway with this thread. I will mark it , "resolved with inet". If I come to know why internetreadfile is giving problems , I will update this thread.

**srisa** · May 25th, 2006, 10:43 AM

I have to admit that I have been lazy. Using google I found this bit of code, which looks like vbscript. I made few changes so that it confirms to vb.
Here is the code

VB Code:

Dim hopen As Long, hfile As Long, bytesread As Long
Dim buffersize As Integer
Dim sbuffer As String, sresult As String
Dim bOk As Boolean
Static count As Integer
       
buffersize = 1024
    'Create an internet connection
hopen = InternetOpen(App.EXEName, INTERNET_OPEN_TYPE_DIRECT, vbNullString, vbNullString, 0)
    'Open the url
hfile = InternetOpenUrl(hopen, surl, vbNullString, ByVal 0&, INTERNET_FLAG_RELOAD, ByVal 0&)
    'Open destfile For Output As #1
    
bOk = True
sresult = vbNullString
 
Do While bOk
  sbuffer = Space(buffersize)
  bytesread = 0
  bOk = InternetReadFile(hfile, sbuffer, buffersize, bytesread)
  If (bytesread > 0) Then
    sresult = sresult & Left$(sbuffer, bytesread)
  End If
  bOk = ((bOk = True) And (bytesread > 0))
  DoEvents
  count = count + 1
  Form1.Caption = CStr(count)
  
Loop
 
txtdata.Text = sresult
 
fh = FreeFile
Open App.Path & "\usingapi.html" For Output As fh
Print #fh, sresult
Close #fh
 
InternetCloseHandle hfile
InternetCloseHandle hopen
MsgBox "Download complete", vbOKOnly + vbInformation

And this is the link.
I didn't find any difference in downloaded data using this code and Inet control. I would be glad if you can confirm it.

Thread: [Resolved with inet control]internetreadfile

Thread Tools

Display

[Resolved with inet control]internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: internetreadfile

Re: [Resolved with inet control]internetreadfile

Posting Permissions